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OR LIGAND BINDING DOMAIN 
(57) Abstract 



T.e present invention relates to novel fusion P^-^rof^^^^^ '^fof 
their use in the genetic engineering of cells. Key °" P"'''^ °/ 'i^^ .^^^^^^^ that contain' it to form multimers ("bundles") 

which being a bundling domain. Bundling <i°">ams "K u e ^ f „f domain. Examples of bundling domams 
through protein-protein interactions with each °'her or with oto proteins coma g t .^^^.^^ p53 ,e,ramer.zat,on 

that L be used in the practice of this 7'"''°" ■"^'"^^ ^^^^tch re^in obs'et^able bundling activity. Cells are engineered by U,e 
domain, a leucine zipper domain, and domains denved therefrom whicn m^i ^^^^^.^ constnicts. to render 

^ r . forrr^f of-n(» rranscriotion. 



them capaoie oi tiganu-u(-K>-">j>-"'- --c * 

(positively, or in some cases, negatively) target gene transcription. 
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FUSION PROTELNS COMPRISING A DIMERlZAllON. TTIIMERIZATION OR;rei^AM^i^-^_nO^ 

ADOmONAL HETEROLOGOUS TR^VNSCRIFTION ACTTVATION. TOANSCRIPTION REPRESSION, DNA BINDING 

OR LIGAND BIN'DING DOMAIN 

Background of the Invention 

Activation of transcription of a eukaryotic gene involves the interaction of a variety 
of proteins to form a complex that is recruited to the gene through protein:DNA interactions. 
Key protein domains on one or more of the components include transcription activat.on 
domains and DNA binding domains. Elucidating the mechanism of transcription, identifying 
and characterizing components of the transcriptional machinery and in some cases 
15 harnessing some of those components have been the subject of extensive research. { 
See e g Brent and Ptashne. 1 985; Hope and Struhi, 1 9B6; Keegan et al. 1 986., F.eids 
and Song. 1989; Spencer et al, 1993, Belshaw et al. 1996 and Rivera et al. 1996)(A 
Bibliography is provided just prior to the Examples, below.) 

Transcription activation domains are thought to function by recruiting a number of 
20 proteins with specific functions to the promoter (Lin and Green, 1 991 ; Goodrich et al, 1 993; 
Orphanides et al. 1996 and references cited therein; Rashne and Gann, 1997 and 
references cited therein). Among the large number of activation domains that have been 
characterized to date, the acidic-activation domain of the Herpes Simplex virus encoded 
protein VP16. is considered to be a very strong inducer of transcription and is w.dely used 
.5 in biological research (Sadowski et al, 1988. Ptashne and Gann. 1 997). The transcription 
activation domain of the p65 subunit of the human transcription factor NF-kB ,s also a very 
potent stimulator of gene expression, and in certain contexts can induce transcnpt.on more 
strongly than VP16 (Schmitz and Baeuerie. 1991; Ballard et al, 1992; Moore at al, 1993, 
Blair et al. 1994; Natesan et al, 1997). Both the VP16 and p65 activation domains are 
30 thought to function by interacting with and recrui^ng a number of proteins to the promoter 
(Cress and Triezenberg, 1990; Scmitzat al, 1994; Uesugi et al, 1997). 

One of the remarkable features of such activation domains is that 'lusing them to 
heterologous protein domains seldom affects their ability to activate transcnpt.on when 
recruited to a wide variety of promoters. The high degree of functional independence 
35 exhibited by these activation domains makes them valuable tools in various biological 

assays for analyzing gene expression and protein-protein or protein-RNA or protein-small 
molecule drug interactions (Fields and Song, 1989; Senguptha et al. 1996; Rivera et al, 
1996; Triezenberg, 1995 and references cited therein). The ability to activate gene 
expression strongly and when recruited to a wide range of promoters makes both p65 and 
40 VP1 6 attractive candidates for activation of gene transcription in gene therapy and other 
applications. However, even more potent activation domains, if available, would be useful 
for achieving higher levels of transcription on a per cell basis, and for improving the 
efficiency of the many biological assays that rely upon activation of transcnption of a 
reporter gene. 
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Several strategies to improve the potency of activation domains and therebyjhe 
expression of genes under their control have been reported (Emami and Carey. 1992; 
Gerberatal 1994; Ohashi et al, 1994; Blair at al, 1996; Tanaka et al, 1996). These 
approaches generally involve increasing the number of copies of activation domains fused 
5 to the DNA binding domain or generating activators containing synergizing combinations of 
,ctivation domains. Although some activators generated by these methods have been 
shown to be more potent, a number of limitations preclude their widespread application. 
First potent activators comprising reiterated activation domains do not increase the 
absolute levels of reporter gene expression when tested on promoters with multiple 
,0 binding sites for the activator (Emami and Carey, 1992). Second, a number of synergistic 
combinations of activation domains reported in the literature involve weak activation 
domains and the absolute levels of gene expression induced by these synergizing 
activation domains are much lower compared to potent acidic activation domains f rom VP1 6 
or p65 (Gerber at al, 1994; Tanaka et al, 1996). Third, it is not known whether any of these 
15 potent activation domains are capable of inducing gene transcription strongly when they 
are non-covalently linked to the DNA binding domain. Fourth, many potent activators 
containing multiple copies of VP16 or other acidic activators are highly toxic and/or 
accumulate to only low levels in the cell. 

As mentioned at the outset, a variety of important applications involving gene 
20 transcription require or would benefit from higher levels of gene expression. As noted 
above however, efforts to improve the potency of activation domains have been 
disappointing. Moreover, expression of various transcription activators revealed that 
observed levels of more potent activators, such as the p65 unit of NF-kB, are lower than 
expected Without wishing to be bound by any one theoiy. we suggest that the more 
25 potent the activation domain, the more toxic it is to the cell, the more disfavored is rts 
expression and/or the less of it is observed to accumulate in cells. How. then, is it 
possible to increase levels of heterologous gene expression? Remarkably, we have 
found that it is still possible to outmaneuver these facts of nature to improve heterologous 
gene expression and have in fact done so using the principles of "bundling", the 
30 engineering of the transcription activation domain, and combinations thereof, as described 
below. 

Summary of the Invention 

This document discloses new improvements in the design and delivery of 
35 transcription activation domains and provides improved materials and methods for 

regulating the transcription of a target gene. Aspects of the invention are applicable to 
systems involving either covalent or non-covalent linking of the transcription activation 
domain to a DNA binding domain. 

Key features of the invention include "bundling" domains, fusion proteins containing 
40 them, recombinant nucleic acids encoding such fusion proteins, systems involving bundles 
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of such fusion proteins, and other materials and methods involving such bundling domains. 
Key fusion proteins of the invention contain at least two mutually heterologous domains, 
one of which being a bundling domain. An important design concept is that the fusion 
proteins do not need to act alone, instead, they find and bind to each other (or with other 

5 proteins containing the bundling domain) to form a posse to accomplish their mission. In 
practice, cells are engineered by the introduction of recombinant nucleic acids encoding the 
fusion proteins, and in some cases v^ith additional nucleic acid constructs, to render them 
capable of ligand-dependent regulation of transcription of a target gene. Administration of 
the ligand to the ceils then regulates (positively, or in some cases, negatively) target gene 

10 transcription. 

Detailed information conceming bundling domains, guidance on their use and 
illustrative examples are provided below. Generally speaking, bundling domains include 
any domain that induces proteins that contain it to form muitimers ("bundles") through 
protein-protein interactions with each other or with other proteins containing the bundling 
15 domain. Examples of bundling domains that can be used in the practice of this invention 
include domains such as the lac repressor tetramerization domain, the p53 tetramerization 
domain, a leucine zipper domain, and domains derived therefrom which retain observable 
bundling activity. Proteins containing a bundling domain are capable of complexing with 
one another to forni a bundle of the individual protein molecules. Such bundling is 
20 "constitutive" in the sense that it does not require the presence of a cross-linking agent 

(i.e.. a cross-linking agent which doesn't itself contain a proteinaceous bundling domain) to 
link the protein molecules- 
Illustrative (non-limiting) examples of heterologous domains which can be included 
along with a bundling domain in various fusion proteins of this invention include 
transcription regulatory domains (i.e., transcription activation domains such as a p65. VP16 
or AP domain; transcription potentiating or synergizing domains; or transcription repression 
domains such as an ssn-6mJP-1 domain or Kruppel family suppressor domain); a DNA 
binding domain such as a GAL4, lex A or a composite DNA binding domain such as a 
composite zinc finger domain or a ZFHD1 domain; or a ligand-binding domain comprising or 
derived from (a) an immunophilin, cyclophilin or FRB domain; (b) an antibiotic binding 
domain such as tetR: or (c) a hormone receptor such as a progesterone receptor or 
ecdysone receptor. 

A wide variety of ligand binding domains may be used in this invention, although 
ligand binding domains which bind to a cell permeant ligand are preferred. It is also 
preferred that the ligand have a molecular weight under about 5kD. more preferably below 
2.5 kD and optimally below about 1500 D. Non-proteinaceous ligands are also preferred. 
Ligand binding domains include, for example, domains selected or derived from (a) an 
immunophilin (e.g. FKBP 12). cyclophilin or FRAP domain; (b) a hormone receptor such as 
a receptor for progesterone, ecdysone or another steroid; and (c) an antibiotic receptor 
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such as a tetR domain for binding to tetracycline, doxycycltne or other analogs or mimics 
thereof. 

Examples of ligand binding domain/ligand pairs that may be used in the practice of 
this invention include, but are not limited to: FKBP:FK1012, FKBP:synthetic divalent FKBP 
hgands (see WO 96/0609 and WO 97/31898), FRB:rapamycin/FKBP (see e.g.. WO 
96/41865 and Rivera et al, "A humanized system for pharmacologic control of gene 
expression". Nature Medicine 2(9):1 028-1032 (1997)), cyciophilin:cycIosporin (see e.g. 
WO 94/18317), DHFR:methotrexate (see e.g. Licitra et ai, 1996, Proc. Natl. Acad. Sci. USA 
93:12817-12821), TetR:tetracycline or doxycycline or other analogs or mimics thereof 
(Gossen and Bujard, 1992, Proc. Natl. Acad. Sci. U.S.A. 89:5547; Gossen et al, 1995, 
Science 268:1766-1769; Kistner et al, 1996, Proc. Natl. Acad. Sci. USA 93:10933-10938), a 
progesterone receptor:RU486 (Wang et al, 1994, Proc. Natl. Acad. Sci. USA 
91 :81 80-81 84), ecdysone receptor ecdysone or muristerone A or other analogs or mimics 
thereof (No et al, 1996, Proc. Natl. Acad.. Sci. USA 93:3346-3351) and DNA 
gyraseicoumermycin (see e.g. Farrar et ai, 1996, Nature 383:178-181). 

A wide variety of DNA binding domains may be used in the practice of this 
invention, including a domain selected or derived from a GAL4, lexA or composite (e.g. 
ZFHD1) DNA binding donnain, or a DNA binding domain, e.g., in combination with ligand 
binding domains such as a wt or mutated progesterone receptor domain. TetR domains, 
v^hich provide both DNA binding and ligand binding functions, are discussed in the context 
of ligand binding domains. In many applications it is preferable to use a DNA binding 
domain which is heterologous to the cells to be engineered. Heterologous DNA binding 
domains include those which occur naturally in cell types other than the cells to be 
engineered as well as composite DNA binding domains containing component portions 
which are not found in the same continuous polypeptide or gene in nature, at least not in 
the same order or orientation or with the same spacing present in the composite domain. In 
the case of composite DNA binding domains, component peptide portions which are 
endogenous to the cells or organism to be engineered are generally preferred. 

In the case of the chimeric transcription factors containing a tetR domain, the DNA 
binding domain is provided by the tetR component, and is by its nature heterologous to 
eukaryotic ceils. TetR domains are discussed in further detail in the context of ligand binding 
domains. 

In embodiments in which an endogenous gene is to be regulatably expressed, a 
composite DNA binding domain which is selected for recognition of one or more sequences 
upstream of the target gene may be deployed. 

Additional information concerning DNA binding domains is provided below. 

in an important application of this invention, two or more of the fusion proteins in the 
bundle each comprise, in addition to the bundling domain, at least one transcription 
activation domiain which is heterologous to the bundling donnain. Bundling of proteins 
containing transcription activation domains can significantly increase their effective potency 
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(relative to a single such fusion protein lacking a bundling domain) and consequently leads 
to strong induction of gene expression. Unlike their counterparts lacking a bundling domain, 
fusion proteins containing a bundling domain are designed to achieve effective local 
concentrations of transcription activation domains and to robustly induce gene expression 
5 when recruited en masse to an expression control sequencfr-even despite relatively low 
overall levels of expression or accumulation of ttie fusion proteins. Highly potent bundled 
activation domains can also be used in a wide variety of assays having transcnptional 
read outs. Such assays include assays for identifying protein-protein interactions (or 
inhibitors thereof) in a eukaryotic, preferably mammalian, tVv'o-hybrid assay or variant 
10 thereof, e.g., three-hybrid assay, reverse two-hybrid assay, etc. 

Bundling domains may be introduced into the design of fusion proteins of a variety 
of regulated gene expression systems, including both allostery-based systems such as 
those regulated by tetracycline, RU486 or ecdysone. or analogs or mimics thereof, and 
dimerization-based systems such as those regulated by divalent compounds like FK1012, 
15 FKCsA, rapamycin, AP1510 or coumemnycin, or analogs or mimics thereof, all as described 
below (See also. Clackson, 1997, Controlling mammalian gene expression with small 
molecules, Current Opinion in Chem. Biol. 1 210-218). The fusion proteins may comprise 
any combination of relevant components, including bundling domains, DNA binding 
domains, transcription activation (or repression) donnains and ligand binding domains. 
20 Other heterologous domains may also be included. 

Various embodiments of this invention involve fusion proteins which contain at 
least one bundling domain, DNA binding domain and transcription activation domain; at 
least one bundling domain, ligand binding domain and transcription repression domain; at 
least one bundling domain, ligand binding domain and DNA binding domain; at least one 
25 bundling domain, ligand binding domain, DNA binding domain and transcription activation 
domain; and, preferably, at least one bundling domain, ligand binding domain and 
transcription activation domain. In currently preferred embodiments, these fusion proteins 
represent improvements on the type described in W094/18317 and W096/41865, 
wherein the ligand binding domain is or is derived from a cyclophilin, immunophilin (e.g. an 
30 FKBP domain) or FRB domairv- although, any ligand binding domain may be used in the 
chimeric proteins, and the regulatory mechanism can be dimerization- or allostery-based, 
A preferred fusion protein contains a lac repressor tetramerization domain, an FRB 
domain and a transcription activation domain derived from the activation domain of human 
p65. It should be appreciated that in any of the embodiments of this invention involving a 
35 fusion protein containing at least one transcription activation domain derived from p65, 

whether with or without a bundling domain, the p65 peptide sequence may be a naturally 
occurring p65 sequence or may be engineered as described below. 

Another aspect of this invention involves improvements in the transcription 
activation domain rtself . In this regard, recombinant nucleic acids are provided which encode 
tusion proteins containing a transcription activation domain and at least one additional 



40 



5 



wo 99/10510 



PCT/US98/1772J 



domain that is heterologous thereto, where the peptide sequence of the activation domain 
is itself modified relative to the naturally occurring sequence from which it was derived to 
increase or decrease its potency as a transcriptional activator relative to the ccunlerpar. 
comprising the native peptide sequence. Certain embodiments of this invention involve 

5 fusion proteins containing a transcription activation domain derived from p65 and bearing 
one or more of the mutations shown in Figure 7. Fusion proteins containing one or more 
modified activation domati^s can also contain a bundling domain to further increase their 
efficacy as transcriptional activators, and/or one or more additional domains such as a 
ligand binding domain, DNA binding domain or transcription activation synergtzing domain, 

10 such as are noted above and as discussed below. 

The invention thus provides recombinant nucleic acid constructs which encode the 
various proteins of this invention or are othen^vise useful for practicing it, various DNA 
vectors containing those constructs for use in transducing prokaryotic and eukaryotic cells, 
cells transduced with the recombinant nucleic acids, fusion proteins encoded by the above 

1 5 recombinant nucleic acids, and target gene constojcts. 

Also provided are nucleic acid compositions comprising two or more recombinant 
nucleic acids which, v^/hen present within a cell, pemiit transcription of a target gene, 
preferably following exposure to a cell permeant ligand. These compositions are illustrated 
as follows: 

20 Composition #1 . A first such composition comprises a recombinant nucleic acid 

encoding a fusion protein comprising at least one ligand binding domain, bundling domain 
and transcription activation domain; a second recombinant nucleic acid encoding a fusion 
protein comprising a DNA binding domain and at least one ligand binding domain; and an 
optional third recombinant nucleic acid comprising a target gene (or cloning site) operatively 

25 linked to an expression control sequence including a DNA sequence recognized by the 

DNA binding domain mentioned above. Such compositions are illustrated by embodiments 
in which the ligand binding domains are or are derived from immunophilin, cyclophilin or FRB 
domains; the transcription activation domain is or is derived from an activation domain such 
as a VP1 6 or p65 domain; and the bundling domain is or is derived from a iac repressor 

30 tetramerization donnain. 

Composition #2. Another such composition is similar to Composition ^1 except 
that the fusion protein encoded by the first recombinant nucleic acid comprises at least one 
ligand binding domain, bundling domain and DNA binding domain, and the fusion protein 
encoded by the second recombinant nucleic acid comprises a transcription activation 

35 domain and at least one ligand binding domain. 

Composition #3. Another such composition comprises a recombinant nucleic acid 
encoding a fusion protein comprising at least one iigand binding domain, bundling domain 
and transcription activation domain; a second recombinant nucleic acid encoding a protein 
comprising a DNA binding domain; and an optional third recombinant nucleic acid comprising 

40 a target gene (or cloning site) operatively linked to an expression control sequence 
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including a DNA sequence recognized by the DNA binding domain nnentioned above. Such 
compositions are illustrated by embodiments in which the ligand binding domains are or are 
derived from a receptor domain such as an ecdysone receptor; the DNA binding domain is 
or IS derived from a DNA binding domain such as an RXR protein, chosen for its ability to 

5 bind lo the receptor domain in the presence of a ligand for that receptor; the transcription 
activation domain is or is derived from an activation domain such as a VP16 or p55 domain; 
and the bundling domain is or is derived from a lac repressor tetramerization domain. 

Composition #4. Another such composition comprises a recombinant nucleic acid 
encoding a fusion protein comprising at least one ligand binding domain, DNA binding 

10 domain, bundling domain and transcription activation domain (where the ligand binding 

ciorniain and DNA binding domain may be part of or derived from the same domain); and an 
optional second recombinant nucleic acid comprising a target gene (or cloning site) 
operativety linked to an expression control sequence including a DNA sequence 
recognized by the DNA binding domain mentioned above. Such compositions are 

15 illustrated by embodiments in which the ligand binding and DNA binding domains are or are 
derived from a receptor domain such as a tetracycline receptor which is capable of binding 
to a characteristic DNA sequence in the presence of tetracycline or another ligand for the 
receptor; the transcription activation domain is or is derived from an activation domain such 
as a VP16 or p65 domain; and the bundling domain is or is derived from a lac repressor 

20 tetramerization domain. Such compositions are further illustrated by embodiments in which 
the ligand binding domain is or is derived from a receptor domain such as a progesterone 
receptor which is capable of binding to progesterone or analogs or mimics thereof, including 
RU486; the DNA binding domain is or is derived from a GAL4 or composite DNA binding 
domain; the transcription activation domain is or is derived from an activation domain such 

25 as a VP16 or p65 domain; and the bundling domain is or is derived from a lac repressor 
tetramerization domain. 

Composition #5. Another such composition, which unlike Compositions 1 - 4 is 
designed for constitutive expression rather than for iigand-mediated regulation of 
transcription, comprises a recombinant nucleic acid encoding a fusion protein comprising at 

30 least one DNA binding domain, bundling donnain and transcription activation domain; and a 
second recombinant nucleic acid comprising a target gene (or cloning site) operatively 
linked to an expression control sequence including a DNA sequence recognized by the 
DNA binding domain mentioned above. Such compositions are illustrated by embodiments 
in which the transcription activation domain is or is derived from an activation domain such 

35 as a VP1 6 or p65 domain; the DNA binding domain is or is derived from a GAL4 or 
composite DNA binding domain; and the bundling domain is or is derived from a lac 
repressor tetramerization domain. 

Compositions 1,3,4 and 5 may further comprise an additional recombinant nucleic 
acid encoding a fusion protein comprising a bundling domain and at least one transcription 
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activation domain or transcription synergizing donnain, with or without one or more optional 
additional domains. 

Each of the recombinant nucleic acids of this invention may iunher comprise an 
expression control sequence operably linked to the coding sequence and may be provided 

5 within a DNA vector, e.g., for use in transducing prokaryotic or eukan/oiic cells. Some or ail 
of the recombinant nucleic acids of a given composition as described above, including any 
optional recombinant nucleic acids, may be present within a single vector or may be 
apportioned between two or more vectors. In certain embodiments, the vector or vectors 
are viral vectors useful for producing recombinant viruses containing one or more of the 

10 recombinant nucleic acids. The recombinant nucleic acids may be provided as inserts 

within one or more recombinant viruses which may be used, for example, to transduce cells 
in vitro or celts present within an organism, including a human or non-human mammalian 
subject. For example, the recombinant nucleic acids of any of Compositions 1 - 5, including 
any optional recombinant nucleic acids, may be present within a single recombinant virus 

15 or within a set of recombinant viruses, each of which containing one or more of the set of 
recombinant nucleic acids. Viruses useful for such embodiments include any virus useful for 
gene transfer, including adenoviruses, adeno-associated viruses (AAV), retroviruses, 
hybrid adenovinjs-AAV, herpes viruses, lenti viruses, etc. In specific embodiments, the 
recombinant nucleic acid comprising the target gene is present in a first virus and one or 

20 more or the recombinant nucleic acids encoding the transcription regulator/ protein(s) are 
present in one or more additional viruses. In such multiviral embodiments, a recombinant 
nucleic acid encoding a fusion protein comprising a bundling domain and a transcription 
activation domain, and optionaliy, a iigand binding domain, may be provided in the same 
recombinant vims as the target gene construct, or alternatively, on a third virus. It should 

25 be appreciated that non-viral approaches (naked DNA, liposomes or other lipid 

compositions, etc.) may be used to deliver recombinant nucleic acids of this invention to 
cells in a recipient organism. 

The invention also provides methods for rendering a cell capable of regulated 
expression of a target gene which involves introducing into the cell one or more of the 

30 recombinant nucleic acids of this invention to yield engineered ceils which can express the 
appropriate fusion protein(s) of this invention to regulate transcription of a target gene. The 
recombinant nucleic acid(s) may be introduced in viral or other form into cells maintained in 
vitro or into cells present v/ithin an organism. The resultant engineered cells and their 
progeny containing one or more of these recombinant nucleic acids or nucleic acid 

35 compositions of this invention may be used in a variety of important applications 

discussed elsewhere, including human gene therapy, analogous veterinary applications, 
the creation of cellular or animal models (including transgenic applications) and assay 
applications. Such ceils are useful, for example, in methods involving the addition of a 
ligand, preferably a cell penmeant ligand, to the cells (or administration of the iigand to an 

40 organism containing the cells) to regulate expression of a target gene. Particularly important 
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animal models include rodent (especially mouse and rat) and non-human primate models. In 
gene therapy applications, the cells wilt generally be human and the peptide sequence of 
each of the various domains present in the fusion proteins (with the possible exception of 
the bundling domain) will preferably be, or be derived from, a peptide sequence of human 
origin. 

In certain assay applications, recombinant nucleic acids are designed as described 
for Composition #1 , except that the ligand binding domains of the fusion proteins are 
replaced with protein domains that are known to bind to each other. Cells transduced with 
these recombinant nucleic acids and with a matched target gene construct express a target 
gene typically selected for convenience of measurement of expression level. These celts 
can be used to identify the presence of a substance which blocks the interaction of the two 
protein domains which are known to interact. 

In other 2-hybrid-type applications aimed at the identification of genes encoding 
proteins which interact with a protein or protein domain of interest, cells are transduced with 
similar recombinant nucleic acids as described immediately above, except that a library of 
test nucleic acid sequences of potential interest is cloned into one of the recombinant 
nucleic acids encoding one of the fusion proteins. A 2-hybrid style assay is conducted in 
which transcription of the target gene indicates the presence of a test nucleic acid sequence 
which encodes a domain that interacts with the protein domain in the cognate fusion 
protein. 

Reverse 2-hybrid-type assays may be conducted analogously using cells 
engineered to positively or negatively regulate expression of a reporter gene as a result of 
"2-hybrid" formation. The cells are exposed to one or more test substances, and inhibition 
of regulation of expression is taken as an indication of possible inhibition of the 2-hybrid 
formation. 

Brief Description of the Figures 

Abbreviations used in the Figures: 

G = yeast GAL4 DNA binding domain, amino acids 1-94 

F = human FKBP12, amino acids 1-107 

R = FRB domain of human FRAP, amino acids 2025-2113 

S = activation domain from the p65 subunit of human NF-kB, amino acids 361-550 
V n activation domain from Herpesvirus VP16, amino acids 410-494 
L = E. coli lactose repressor, amino acids 46-360 

MT = Minimal Tetramerization ("bundling") domain of E. coli lactose repressor, amino acids 324- 
360 

FIG. 1 Diagram comparing various fusion proteins, with and without bundling domains, and 
their use in various strategies for delivery of activation domains to the promoter of a target 
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,ene (A) Kvo fusion proteins, one containing a DNA binding domain (e.g. a GAU or 
ZFHD DNA b,nding doma.n) fused to an FKBP12, and the other conta,n,ng a p65 
ac,ivat,on dor.a,n fused to an FRB, are expressed ,n cells. Addition of rapamycn eads to 
Te recruitment of a s.nge act,vat,on doma.n to each DNA hind.ng doma.n mono, er. B 
Fusion of multiple FKBPs to the DNA binding domain allows rapamycn to recruit mulfple 
ration domains to each DNA binding domain monomer. (C) Addition o, the lactose 
repressor tetramenzation domain to the FRB-activation domain fusion allows rapamycn to 
recruit four activation domains ,o each FKBP fused to the DNA binding domain D) 
RaDamvcn recruits bundled activation domain fusion protein to each of the FKBP-DNA 
S dial fusion proteins. (E) and (F) illustrate a mutated tetR-based system without 
and wUh bundling. (G) and (H) illustrate an engineered progesterone-R-based system, 
without and with bundling. 

FIG 2 Expression levels of the stably integrated reporter gene correlate with the number 
of activa^on domains r^ru.ted to the promoter. The indicated DNA binding dornain and 
acLion domain fusions were transfected into HT1080B cells containing a stably 
rjated SEAP reporter. Mean values of SEAP activity secreted into the medium following 
Tdlon ono nM raW" are shown (./- S.C.). In all cases. SEAP -P-^^ - 
are plotted for cultures receiving 100 ng of activation domain «>^P^«^,^'°" Pf 
gives peak expression values in transiently transfected cells and slightly below peak 
levels in the stably translected cell line. 

FIG 3 Svnerg, beMeen the activation domains in the RLS bundle is the piiman, cause 

fthe s!; - Ltion o, the repo^er gene express^n. a, schematic ^^^^^^^^^^^^ 
lp„s;ono,,h,p,otei„bundleso,RUS-increas,ngconcenUa.,^d^^ 
LS or L in the cell, b) Twenty nanograms ol GF1 encoding plasmid vras co-tran fectad with 
, 00 no ol RLS alone or with indicated ccncenttations ol LS or L regions. The cells were 
s l ated with ,0 nM rapantycin and the SEAP act»ity in the meditim was measur^^d 1 8 
L a«e, translecoon. Mean values 0, SEAP activity secreted into the J *»"^ 
addition ol rapamycih are shown S.D.). c) Western blot analyses using 12CA5 
Inldy against hemagglutinin epitope o. various recombinant proteins expressed ,n the 
transfected cells is shown. 

FIG 4 A thirty-six amino acd region in the carboxy terminal of the lactose repressor 
s protein is sufficent for generating highly potent and bundled activation dom-n fusion 
proteins HTf 080 B celfs were co-transfected with 20 ng GF1 and 100 ng of indicated 
activation doma.n containing plasmid vectors. Transcription of the -Porter gerte was 
timulated by the addition of 1 0 nM rapamycin in the medium. Mean values o SEAP 
activity seer ted into the medium assayed 24 hrs after transfeCon are shown (./- S.D.) 



40 



10 



wo 99/10510 



PC'r/U.S9K/17723 



FIG 5 Tethering bundled activation domain fusion proteins to DNA binding proteins 
significantly reduces the amount of reconstituted activators required to strongly stimulate 
the target gene expression, a) Twenty nanograms of GF4 and indicated concentrations of 
act.vat.on domain expressing plasmids were transfected into HT1080 B cells. Transcription 
of the stably integrated reporter gene was induced by the addition of 10 nM rapamycin in 
the medium b) Western blot analysis of the relative expression levels of the transfected 
transcnption factors, c) Twenty nanograms of GF4 and one hundred nanograms of the 
indicated activation domain fusion protein encoding plasmids were co-transfected into 
HT1080 B cells and the transcriptional activity of the GAL4 responsive reporter gene was 
induced by the addition of indicated concentrations of rapamycin in the medium. In all 
cases, mean values of SEAP activity secreted into the medium 24 hrs after the addition of 
rapamycin are shown (+/- S.D.), 

FIG 6 Bundling the target-activation domain fusion protein improves the sensitivity of the 
two-hybrid assay in mammalian cells. Diagram showing two-hybrid assay using bundled 
fusion protein containing the target and activation domains. GAL4 DNA binding domam 
fused to c-Cbl (GCBL) is shown interacting with its target protein SH3 fused to either a) 
VP1 6 activation domain (SH3S) or b) lactose repressor tetramerization domain-VPI 6 
activation domain sequences (SH3MTS). c) HT1080 B cells containing stably .ntegrated 
GAL4 responsive reporter gene were transfected with 1 00 ng of indicated expression 
plasmids. Mean values of SEAP activity secreted into the medium 24 hrs after transfecfon 
are shown (+/- S.D.). 

FIG 7 Mutations for the p65 transcription activation domain are listed, including: 

1 . Mutations that are intended to increase activation potency, including M1 , M2, M6.M7 

and M8. . . 

2. Mutations that are intended to slightly decrease activation potency, including M4 and 

M5. 



Detailed Description of the Invention 



Definitions . . 

For convenience, the intended meaning of certain terms and phrases used herein 

are provided below. 

"Activate" as applied to the expression or transcription of a gene denotes a 
directly or indirectly observable increase in the production of a gene product, e.g., an RNA 
or polypeptide encoded by the gene. 

"Capable of selectively hybridizing" means that two DNA molecules are 
susceptible to hybridization with one another, despite the presence of other DNA 
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molecules under hybridization conditions which can be chosen or readily determined 
empincallv by the practitioner of ordinary skill in this art. Such treatments include conditions 
of high sthngency such as washing extensively with buffers containing 0.2 to 6 x SSC 
and/or containing 0.1% to r/c SDS, at temperatures ranging from room temperature to 65- 
75=C. See for example F.M. Ausubel et al., Eds, Short Protocols in Molecular Biology, 
Units 6.3 and 6.4 (John Wiley and Sons, New York, 3d Edition, 1995). 

"Cells" "host ceils" or "recombinant host cells" refer not only to the particular 
cells under discussion, but also to their progeny or potential progeny. Because certain 
modifications mav occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still 
included within the scope of the term as used herein. 

•■Cell line" refers to a population of cells capable of continuous or prolonged 
growth and division in vitro. Often, cell lines are clonal populations derived from a single 
progenitor cell It is further known in the art that spontaneous or induced changes can occur 
in karyotype during storage or transfer of such clonal populations. Therefore, cells derived 
from the cell line referred to may not be precisely identical to the ancestral cells or cultures, 
and the cell line referred to includes such variants. 

"Composite", "fusion", and "recombinant" denote a material such as a nucleic 
acid nucleic acid seqiience or polypeptide which contains at least two constituent portions 
which are mutually heterologous in the sense that they are not othenA/ise found directly 
(covalently) linked in nature, i.e.. are not found in the same continuous polypeptide or gene 
in nature, at least not in the same order or orientation or with the same spacing present in 
the composite, fusion or recombinant product. Typically, such materials contain 
components derived from at least two different proteins or genes or from at least two non- 
adjacent portions of the same protein or gene. In general, "composite" refers to portions of 
different proteins or nucleic adds which are joined together to fomn a single functional unit, 
while "fusion" generally refers to two or more functional units which are linked together. 
"Recombinanf is generally used in the context of nucleic acids or nucleic acid sequences. 

"Cofactor" refers to proteins which either enhance or repress transcnption in a 
non-gene specific manner. C^ofactors typically lack intrinsic DNA binding specificity, and 
function as general effectors. Positively acting cofactors do not stimulate basal 
transcription, but enhance the response to an activator. Positively acting cofactors include 
PCI , PC2, PCS. PC4. and ACF. TAFs which interact directly with transcriptional activators 
are also referred to as cofactors. 

A "coding sequence" or a sequence which "encodes" a particular polypeptide or 
RNA is a nucleic acid sequence which is transcribed (in the case of DNA) and translated 
(in the case of mRNA) into a polypeptide in vitro or in vivo when placed under the control 
of an appropriate expression control sequence. The boundaries of the coding sequence 
are generally detemiined by a start codon at the 5' (amino) terminus and a translat^n stop 
codon at the 3' (cartx)xy) temiinus. A coding sequence can include, but is not limited to. 
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cDNA from procaryotic or eukaryotic mRNA. genomic DNA sequences from procaryotic or 
eukan/otic DNA, and synlhettc DNA sequences. A transcription termination sequence will 
usually be located 3' to the coding sequence. 

The term "conjoint", with respect to administration of two or more viruses, refers to 
the simultaneous, sequential or separate dosing of the individual virus provided that some 
overlap occurs in the simultaneous presence of the viruses in one or more cells of the 
animial. 

A "construct", e.g.. a "nucleic acid construcf or "DNA construct", refers to a 
nucleic acid or nucleic acid sequence. 

"Derived from" denotes a peptide or nucleotide sequence selected from within a 
given sequence. A peptide or nucleotide sequence derived from a named sequence may 
further contain a small number of modifications relative to the parent sequence, in most 
cases representing deletion, replacement or insertion of less than about 15%, preferably 
less than about 10%, and in many cases less than about 5%, of amino acid residues or 
bases present in the parent sequence. In the case of DNAs, one DNA molecule is also 
considered to be derived from another if the two are capable of selectively hybridizing to 
one another. Polypeptides or polypeptide sequences are also considered to be derived 
from a reference polypeptide or polypeptide sequence if any DNAs encoding the two 
polypeptides or sequences are capable of selectively hybridizing to one another. 
Typically, a derived peptide sequence will differ from a parent sequence by the 
replacement of up to 5 amino acids, in many cases up to 3 amino acids, and very often by 
0 or 1 amino acids. A derived nucleic acid sequence will differ from a parent sequence by 
the replacement of up to 15 bases, in many cases up to 9 bases, and very often by 0 - 3 
bases. In some cases the amino acid(s) or base(s) is/are added or deleted rather than 
replaced. 

"Domain" refers to a portion of a protein or polypeptide. In the art, the term 
"domain" may refer to a portion of a protein having a discrete secondary stmcture. 
However, as will be apparent from the context used herein, the term "domain" as used in 
this document does not necessariy connote a given secondary structure. Rather, a 
peptide sequence is referred to herein as a "domain" simply to denote a polypeptide 
sequence from a defined source, or having or conferring an intended or observed activity. 
Domains can be derived from naturally occurring proteins or may comprise non-naturally- 
occurring sequence. 

"DNA recognition sequence" means a DNA sequence which is capable of 
binding to one or more DNA-binding domains, e.g., of a transcription factor or an engineered 
polypeptide. 

"Expression control element", or simply "control element", refers to DNA 
sequences, such as initiation signals, enhancers, promoters and silencers, which induce or 
control transcription of DNA sequences with which they are operably linked. Control 
elements of a gene. may be located in introns, exons, coding regions, and 3* flanking 
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sequences. Some control elements are "tissue specific", i.e., affect expression of the 
selected DNA sequence preferentially in specific cells (e.g., cells of a specific tissue), wfiiie 
others are active in many or most cell types. Gene expression occurs preferentially in a 
specific cell if expression in this eel! type is observably higher than expression in other cell 
types. Control elements include so-called "leaky" promoters, which regulate expression of 
a selected DNA pnmarily in one tissue, but cause expression in other tissues as well. 
Furthermore, a control element can act constitutively or inducibly. An inducible promoter, for 
example, is demonstrably more active in response to a stimulus than in the absence of that 
stimulus. A stimulus can comprise a hormone, cytokine, hea^/y metai, phorboi ester, cyclic 
AMP (cAMP), retinoic acid or derivative thereof, etc. A nucleotide sequence containing one 
or more expression control elements may be referred to as an "expression control 
sequence". 

"Gene" refers to a nucleic acid molecule or sequence comprising an open reading 
frame and including at least one exon and (optionally) one or more intron sequences. 

"Genetically engineered cells" denotes cells which have been modified by the 
introduction of recombinant or heterologous nucleic acids (e.g. one or more DNA constructs 
or their RNA counterparts) and further includes the progeny of such cells which retain part 
or all of such genetic nDodrfication. 

"Heterologous", as it relates to nucleic acid or peptide sequences, denotes 
sequences that are not nonrally joined together, and/or are not normally associated with a 
particular cell Thus, a "heterologous" region of a nucleic acid construct is a segment of 
nucleic acid within or attached to another nucleic acid nroiecule that is not found in 
association with the other molecule in nature. For example, a heterologous region of a 
construct could include a coding sequence flanked by sequences not found in association 
with the coding sequence in nature. Another example of a heterologous coding sequence 
is a constnjct where the coding sequence itself is not found in nature (e.g., synthetic 
sequences having codons different from the native gene). Similarly, in the case of a cell 
transduced with a nucleic acid construct which is not normally present in the cell, the cell 
and the construct would be considered mutually heterologous for purposes of this 
Invention. Allelic variation or naturally occurring mutational events do not give rise to 
heterologous DNA, as used herein. 

"Initiator" refers to a short, weakly conserved element that encompasses the 
transcription start site and which is important for directing the synthesis of properiy initiated 
transcripts. 

"Interact" refers to directly or indirectly detectable interactions between molecules, 
such as can be detected using, for example, a yeast two hybrid assay or by 
immunoprecipitation. The term "interacf encompasses "binding" interactions between 
molecules, interactions may be, for example, protein-protein, protein-nucleic acid, protein- 
snnall molecule or small molecule-nucleic acid in nature. 
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"Minimal promoter" refers to the minimal expression control element that is 
capable of initiating transcription ol a selected DNA sequence to which it is operably linked. 
A minimal promoter frequently consists of a TATA box or TATA-like box. Numerous minimal 
promoter sequences are known in the literature. 

"Nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), 
and where appropriate, ribonucleic acid (RNA). The term should also be understood to 
include derivatives, variants and analogs of either RNA or DNA made from nucleotide 
analogs, and, as applicable to the embodiment being described, single (sense or 
antisense) and double-stranded polynucleotides. 

"Operably linked" when referring to an expression control element and a coding 
sequence means that the expression control element is associated with the coding 
sequence in such a manner as to permi or facilitate transcription of the coding sequence. 

A "recombinant virus" is a virus particle in which the packaged nucleic acid 
contains a heterologous portion. 

"Protein", "polypeptide" and "peptide" are used interchangeably. 
A "target gene" is a nucleic acid of interest, the expression of which is modulated 
according to the methods of the invention. The target gene can be endogenous or 
exogenous and can integrate into a cell's genome, or remain episomal. The target gene can 
encode, for instance, a protein, an antisense RNA or a ribozyme. 

The terms "transcriptional activation unit" and "activation unit", refer to a 
peptide sequence which is capable of inducing or otherwise potentiating transcription 
activator^ependent transcription, either on its own or when linked covalently or non- 
covalently to another transcriptional activation unit. An activation unit may contain a minimal 
polypeptide sequence which retains the ability to interact directly or indirectly with a 
transcription factor. Unless otherwise dear from the context, where a fusion protein is 
referred to as -including" or "comprising" an activation unit, it will be understood that other 
portions of the protein from which the activation unit is derived can be included. 
Transcriptional activation units can be rich in certain amino acids. For example, a 
transcriptional activation unit can be a peptide rich in acidic residues, glutamine, proline, or 
serine and threonine residues. Other transcriptional activators can be rich in .soleucme or 
basic amino acid residues (see, e.g., Triezenberg (1995) Cur. Opin. Gen. Develop. 5:190, 
and references cited therein). For instance, an activation unit can be a peptide motif of at 
least about 6 amino acid residues associated with a transcription activation domain, 
including the well-known "acidic", "glutamine-rich" and "proline-rich" motifs such as the K1 3 
motif from p65, the OCT2 Q domain and the 0CT2 P domain, respectively. 

The term "transcriptional activator^' refers to a protein or protein complex, the 
presence of which can increase the level of gene transcription in a cell of a responsive 
gene It is thought that a transcriptional activator is capable of enhancing the efficiency 
with which the basal transcription complex perfomns. i.e.. activating transcription. Thus, as 
used herein, a transcriptional activator can be a single protein or alternatively ,t can be 
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composed of several un.ts at least some ol which are not covalently l.nked to each other. 
A transcriptional activator topically has a modular structure, i.e., comprises one or more 
component domains, such as a DMA binding domain and one or more transcriptional 
activation units or dor^ains. Transcriptional activators are a subset o( transcription factors, 
defined below. 

•'Transcription factor" refers to any protein whose presence or absence 
contributes to the initiation of transcription but which is not itself a part of the polymerase. 
Cer^>ain transcription factors stimulate transcription transcriptional activators"); other 
repress transcription rtranscnptional repressors"). Transcription factors are generally 
classifiable into two groups: (i) the general transonption factors, and (ii) the transcription 
activators Transcription factors usually contain one or more regulatory domains. Some 
transcnption factors contain a DNA binding domain, which is that part of the transcription 
factor which directly interacts with the expression control element of the target gene. 

"Transcription regulatory domain" denotes any domain which regulates 
transcription, and includes activation, synergizing and repression domains. The term 
"activation domain" denotes a domain, e.g. in a transcription factor, which positively 
regulates (increases) the rate of gene transcription. The tem. "repression domain" denotes 
a domain which negatively regulates (inhibits or decreases) the rate of gene transcnpfon. 

A " transcription synergizing domain" is defined as any domain which 
increases the potency of transcriptional activation when present along with the 
transcription activation domain. A synergizing domain can be an independent 
transcriptional activator, or alternatively, a domain which on its own does not induce (or 
does not usually induce) transcription but is able to potentiate the activity of a transcnption 
aclvln domain. The synergizing domain can be a component domain of a fusion protein 
containing the activation domain or can be recruited to the DNA binding domain or other 
component of the transcription complex, e.g.. via a bundling interaction. 

•Transfection" means the introduction of a naked nucleic acid molecule into a 
recipient cell. "Infection" refers to the process wherein a nucleic acid is introduced into a 
cell by a vims containing that nucleic acid. A "productive infection" refers to the process 
wherein a virus enters the cell, is replicated, and is then released from the cell (sometimes 
referred to as a "lytic" infection). "Transduction" encompasses the introduction of nucleic 
acid into cells by any means. ^ • 

"Transgene" refers to a nucleic acid sequence which has been introduced into a 
cell Daughter cells deriving from a cell in which a transgene has been introduced are also 
said to contain the transgene (unless it has been deleted). The polypeptide or RNA 
encoded by a transgene may be partly or entirely heterologous, i.e.. foreign, with respect 
to the animal or cell into which it is introduced. Alternatively, the transgene can be 
homologous to an endogenous gene of the transgenic animal or cell into which rt is 
introduced, but is designed to be inserted, or is inserted, into the animal's genome in such 
way as to alter the genome of the cell into which it is inserted (e.g.. -t is inserted at a 
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location which differs from that of the natural gene). A transgene can also be present in an 
episome. A transgene can include one or more expression control elements and any other 
nucleic acid, (e.g. intron), that may be necessary or desirable for optimal expression of a 
selected coding sequence. 

The term "vector" refers to a nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of vector is an episome, i.e.. a nucleic 
acid capable of extra-chromosomal replication. Often vectors are used which are capable 
of autonomous replication and/or expression of nucleic acids to which they are linked. 
Vectors capable of directing the expression of an included gene operatively linked to an 
expression control sequence can be referred to as "expression vectors". Expression 
vectors are typically in the form of "plasmids" which refer generally to circular double 
stranded DNA loops which, in their vector form are not bound to the chromosome. In the 
present specification, "plasmid" and "vector" are used interchangeably as the plasmid is 
the most commonly used form of vector. However, the invention is intended to include such 
other forms of vectors which serve equivalent functions and which are or become known in 
the art. Viral vectors are nucleic acid molecules containing viral sequences which can be 
packaged into viral particles. 
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Bundling domains 

As described above, bundling domains interact with like domains via protein-protein 
interactions to induce formation of protein "bundles". Various order oligomers (dimers, 
trimers, tertramers, etc.) of proteins containing a bundling domain can be formed, depending 
on the choice of bundling domain. 

One example of a dimerization domain is the leucine zipper (LZ) element. Leucine 
zippers have been identified, generally, as stretches of about 35 amino acids containing 4- 
5 leucine residues separated from each other by six amino acids (Maniatis and Abel (1989) 
Nature 341 -.24-25). Exemplary leucine zippers occur in a variety of eukaryotic DNA 
binding proteins, such as GCN4. C/EBP. c-Fos, c-Jun, c-ty/lyc and c-Max. Other 
dimerization domains include helix-loop-helix domains (Murre, C. el al. (1989) Cell 58:537- 
544). Dimerization domains may also be selected from other proteins, such as the retinoic 
acid receptor, the thyroid hormone receptor or other nuclear honnone receptors (Kurokawa 
et al. (1993) Genes Dev. 7:1423-1435) or from the yeast transcription factors GAL4 and 
HAP1 (Marmonstein et al. (1992) Nature 356:408-414; Zhang et al. (1993) Proc. Natl. Acad. 
Sci. USA 90.2851 -2855). Dimerization domains are further described in U.S. Patent No. 
5,624,818 by Eisenman. 



17 



wo 99/10510 



PCT/US98/I77:3 



Of particular current interest are tetramer-forming bundling domains. Incorporation of 
such a tetramerization donnain within a fusion protein leads to the constitutive assembly of 
tetrameric clusters OT"bundles. For example, a bundle of four activation units can be 
assembled by covalently linking the activation unit to a tetramerization domain. By 

5 clustering the activation units together through a bundling domain, four activation units can 
be delivered to a single DNA binding domain at the promoter. The E. coli lactose repressor 
tetramerization domain (amino acids 46-360; Chakenan et al. (1991) J. Biol. Chem. 
266:1371; Alberti et al. (1993) EMBO J. 12:3227; and Lewis et al. (1996) Nature 
271:1247), illustrates this class. Furthermore, since the fusion proteins may contain more 

10 than one activation unit linked to the bundling domain, each of the four proteins of the 
tetramer can contain more than one activation unit (and the complex may comprise more 
than 4 activation units). 

Other illustrative tetramerization domains Include those derived from residues 322- 
355 of p53 (Wang et a!. (1 994) Mol. Ce!!. Biol. 1 4:51 82; Clore et al. (1 994) Science 

15 265:386) see also U.S. Pat. No. 5,573,925 by Halazonetis. Other bundling domains can 
be derived from the Dimerization cofactor of hepatocyte nuclear factor-1 (DCoH), DCoH 
associates with specific DNA binding proteins and also catalyses the dehydration of the 
biopterin cofactor of phenylalanine hydroxylase. DCoH is a tetramer. See e.g. Endrizzi. 
J.A.. Gronk. J.D., Wang, W., Crabtree, G.R and Aiber. T, (1 995) Science 268, 556559; 

20 Suck and Rcner (1 996) FEBS Lett 389(1 ):3-39; Standmann, Senkel and Ryffel (1 998) Int J 

Dev Biol 42(1):53-59 

The bundling domain may comprise a naturally-occurring peptide sequence or a 
modified or artificial peptide sequence. Sequence modifications in the bundling domain may 
be used to Increase the stability of bundle formation or to help avoid unintended bundling 
25 with native protein molecules in the engineered cells which contain a wild-type bundling 
domain. 

For example, sequence substitutions that stabilize ollgomerization driven by leucine 
zippers are known (Krylov et al. (1994) cited above; O'Shea et al. (1992) cited above). 
To illustrate, residues 174 or 175 of human p53 may be replaced by glutamine or leucine, 
30 respectively. 

To illustrate sequence modifications aimed at avoiding unintended bundling with 
endogenous protein molecules, the p53 tetramerization domain may be modified to reduce 
the likelihood of bundling with endogenous p53 proteins that have a wild-type p53 
tetramerization domain, such as wild-type p53 or tumor-derived p53 mutants. Such 

35 altered p53 tetramerization domains are described in U.S. Pat. No. 5,573,925 by 

Halazonetis and are characterized by disruption of the native p53 tetramerization domain 
and insertion of a heterologous bundling domain in a way that preserves tetramerization. 
Disruption of the p53 tetramerization domain involving residues 335-348, or a subset of 
these residues, sufficiently dismpts the function of this domain so that it can no longer drive 

40 tetramerization with wild-type p53 or tumor-derived p53 mutants. At the same time, 
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however, introduction of a heterologous dimerization domain reestablishes the ability to 
tonm tetramers, which is mediated both by the heterologous dimerization domain and by the 
residua! portion of the p53 tetramerization domain sequence. 

Other suitable bundling domains can be readily selected or designed by the 

5 practitioner, including semi-artificial bundling domains, such as variants of the GCN4 leucine 
zipper that form tetramers (Alberti et al. (1 993) EfvlBO J. 12:3227-3236; Harbury et al. 
(1993) Science 262:1401-1407; Krylov etal.(1994) (1994) EMBO J. 1 3:2849-2861). The 
tetrameric variant of GCN4 leucine zipper described in Harbury et aL (1 993), supra, has 
isoleucines at positions d of the coiled coil and leucines at positions a, in contrast to the 

10 original zipper which has leucines and valines, respectively. 

The choice of bundling domain can be based, at least in part, on the desired 
conformation of the bundles. For instance, the GCN4 leucine zipper drives parallel subunit 
assembly [Harbury et al. (1993), cited above], while the native p53 tetramerization 
domain drives antiparallel assembly [Clore et al. (1994) cited above; Sakamoto et al. 

15 (1994) Proc. Natl. Acad. Sci. USA 91:8974-8978]. 

In addition, a variety of techniques are available for identifying other naturally 
occum'ng bundling domains, as well as for selecting bundling domains derived from mutant 
or otherwise artificial sequences. See, for example, Zeng et al. (1 997) Gene 1 85:245; 
O'Shea et al. (1 992) Ge!l 68:699-708; Krylov et al. [cited above]. 

20 In applications of the invention involving the genetic engineering of cells within (or 

for use within) whole animals, the use of peptide sequence derived from that species is 
preferred when possible. For instance, for applications involving human gene therapy, use 
of bundling donnains derived from human proteins may minimize the risk of immunogenic 
reactions. However, in some cases the use of bundling domains of human origin may 

25 induce interactions between the fusion proteins and the endogenous protein from which the 
bundling domain was derived, i.e., leading to unwanted bundling of fusion proteins with the 
endogenous protein containing the identical bundling domain. Such interactions, in addition 
to inhibiting target gene expression, may also have other adverse effects in the cell, e.g., 
by interfering with the function of the endogenous protein from which the bundling domain 

30 was derived. 

Approaches for avoiding unwanted bundling of fusion proteins of this invention 
with endogenous proteins include using a bundling domain which is (a) heterologous to the 
host organism, (b) expressed by the host organism but only (or predominantly) in cells or 
tissues other than those which will express the fusion proteins, or (c) engineered through 
35 modification in peptide sequence such that it bundles preferentially with itself rather than 
with an endogenous bundling domain. 

The first approach is illustrated by the use of a bacterial lac repressor 
tetramerization domain in hunnan cells. 

The second approach requires the use of a bundling domain derived from a protein 
40 which is not expressed in the cells or tissues which are to be engineered to express the 
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fusion protein(s) of this invention, at least not at a level which would cause undue 
interference with the bundling application or with normal ceil function. Fusion proteins 
containing a bundling domain derived from an endogenous protein expressed selectively or 
preferentially in one tissue could be expressed in a different tissue vyithout any adverse 

5 effects. For example, to regulate gene expression in human muscle, fusion proteins 

containing bundling domains from a protein expressed in liver, brain or some other tissue or 
tissues — but not in muscle — can be expressed in muscle ceils without undue risk of 
mismatched bundling. 

In the third approach, and as noted previously, the binding specificity of the 

10 bundling domain is engineered by alterations in peptide sequence to replace (in whole or 
part) bundling activity for proteins containing the wild-type bundling domain with bundling 
activity for proteins containing the modified peptide sequence. 

Several examples of tissue-specific bundling domains which could be used in the 
practice of this invention include bundling domains derived from the Retinoid X receptor, 

15 (Kersten, S., Reczek, P.R and N. Noy (1997) J. Biol. Chem. 272, 29759-29768); 

Dopamine D3 receptor (Nimchinsky, EA, Hof, P.R., Janssen, W.G.M., Morrison, J.H and 
C. Schmauss (1997) J. BioL Chem. 272. 29229-29237); Butyrylcholinesterase (Blong. 
R.M., Bedows, E and 0. Lockridge (1997) Biochem. J. 327, 747-757); Tyrosine 
Hydroxytase (Goodwill, K.E., Sabatier, C. Marks, C, Raag, R., Fitzpatrick, P.F and R.C. 

20 Stevens (1997) Nat. Struct. Biol 7. 578-585). Bcr (McWhirter, J.R., Galasso, D.L and J.Y. 
Wang (1993) MoL Cell. Biol. 13, 7587-7595); and Apolipoprotein E (Westerlund, J.A and 
K.H. Weisgraber (1993) J. Biol. Chem. 268, 15745-15750). 

Transcription Activation Domains / Activation Units 

25 Transcription activation domains and activation units can comprise naturally- 

occurring or non-naturaliyoccurring peptide sequence so long as they are capable of 
activating or potentiating transcription of a target gene construct. A variety of polypeptides 
and polypeptide sequences which can activate or potentiate transcription in eukaryotic 
celis are known and in many cases have been shown to retain their activation function 

30 when expressed as a component of a fusion protein. An activation unit is generally at 

least 6 amino acids, and preferably contains no more than about 300 amino acid residues, 
more preferably less than 200. or even less than 100 residues. 

Naturally occurring activation units include portions of transcription factors, such as 
a thirty amino acid sequence from the C-terminus of VP16 (amino acids 461-490), referred 

35 to herein as "Vc". Other activation units are derived from naturally occurring peptides. For 
example, the replacement of one amino acid of a naturally occuning activation unit by 
another may further increase activation. An example of such an activation unit is a 
derivative of an eight amino acid peptide of VP16, the derivative having the amino acid 
sequence DFDLDMLG. Other activation units are "synthetic" or "artificial" in that they are 
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not derived from a naturally occurring sequence. It is known, for example, that certain 
random alignments of acidic amino acids are capable of activating transcription. 

Certain transcription factors are known to be active only in specific cell types, i.e., 
Ihey activate transcnption in a tissue specific manner. By using activation units which 
function selectively or preferentially in specific ceils, it is possible to design a transcriptional 
activator of the invention having a desired tissue specificity. 

One source of peptide sequence for use in a fusion protein of this invention is the 
herpes simplex virus virion protein 1 6 (referred to herein as VP1 6, the amino acid sequence 
of which is disclosed in Triezenberg, S J. et al. (1 988) Genes Dev. 2:71 8-729). For 
example, an activation unit corresponding to about 127 of the C-terminal amino acids of 
VP16 can be used. Alternatively, at least one copy of about 1 1 amino acids from the C- 
terminal region of VP16 which retains transcription activation ability is used as an activation 
unit. Preferably, an oligomer comprising two or more copies of this sequence is used. 
Suitable C-terminal peptide portions of VP1 6 include those described in Seipel, K. et al. 
(EMBO J. (1992) 13:4961-4968). 

Another example of an acidic activation unit is provided in residues 753-881 of 

GAL4. 

One particularly important source of transcnption activation units is the (human) NF- 
kB subunit p65. The activation domain may contain one or more copies of a peptide 
sequence comprising all or part of the p65 sequence spanning residues 450-550. or a 
peptide sequence derived therefrom. In certain embodiments, it has been found that 
extending the p65 peptide sequence to include sequence spanning p65 residues 361-450, 
e.g., including the "AP activation unif, leads to an unexpected increase in transcnption 
activation. Moreover, a peptide sequence comprising ail or a portion of p65(361 -550), or 
peptide sequence derived therefrom, in combination with heterologous activation units, can 
yield surprising additional increases in the level of transcription activation.. p65-based 
activation domains function across a broad range of promoters and in a number of bundling 
experiments have yielded increases in transcription levels of chromosomally incorporated 
target genes six-fold, eight-fold and even 14-15-foId higher than obtained with unbundled 
tandem copies of VP 16 which itself is widely recognized as a very potent activation 
domain. 

It is expected that recombinant DNA molecules encoding fusion proteins which 
contain a p65 activation unit, or peptide sequence derived therefrom, will provide significant 
advantages for heterologous gene expression in its various contexts, including dimerization 
based regulated systems such as described in International patent applications 
PCT/US94/01617, PCT/US95/10591 . PCT/US96/09948 and the like, as well as in other 
heterologous transcription systems including allostery-based regulation such as those 
involving tetracycline-based regulation reported by Bujard et al. and those involving 
steroid or other hormone-based regulation. 
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Adenovirus can infect virtually all epithelial cells regardless of their cell cycle stage. So far, 
adenoviral infection appears to be linked only to mild disease such as acute respiratory 

disease in the human. 

Adenovirus is particularly suitable for use as a gene transfer vector because of its 
mid-sized genome, ease of manipulation, high titer, wide target-cell range, and high 
inloctivity. Both ends of the viral genome contain 100-200 base pair (bp) inverted terminal 
repeats (ITR). which are cis elements necessary for viral DNA replication and packaging. 
The early (E) and late (L) regions of the genome contain different transcription domains that 
are divided by the onset of viral DNA replication. The E1 region (El A and E1 B) encodes 
proteins responsible for the regulation of transcription of the viral genome and a few cellular 
genes. The expression of the E2 region (E2A and E2B) results in the synthesis of the 
proteins for viral DNA replication. These proteins are involved in DNA replication, late gene 
expression, and host cell shut off (Renan (1990) Radiotherap. Oncol. 19:197). The 
products of the late genes, including the majority of the viral capsid proteins, are expressed 
only after significant processing of a single primaiy transcript issued by the major late 
promoter (MLP). The MLP (located at 16.8 m.u.) is particularly efficient during the late 
phase of infection, and all the mRNAs issued from this promoter possess a 5' tripartite 
leader (TL) sequence which makes them preferred mRNAs for translation. 

The genome of an adenovirus can be manipulated such that it encodes a gene 
product of interest, but is inactivated in terms of its ability to replicate in a normal lytic viral 
life cycle (see, for example, Berkner et al.. (1 988) BioTechniques 6:61 6; Rosenfeld et al.. 
(1991) Science 252:431-434; and Rosenfeld et al., (1992) Cell 68:143-155). Suitable 
adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains of 
adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled In the art. 
Recombinant adenoviaises can be advantageous in certain circumstances in that they are 
not capable of infecting nondividing cells and can be used to infect a wide variety of cell 
types, including aiway epithelium (Rosenfeld et al., (1992) cited supra), endothelial cells 
(Lemarchand et al., (1992) PNAS USA 89:6482-6486), hepatocytes (Herz and Gerard, 
(1 993) PNAS USA 90:281 2-281 6) and muscle ceils (Quantin et al., (1 992) PNAS USA 
89:2581-2584). Adenovirus vectors have also been used in vaccine development 
(Grunhaus and Horwitz (1992) Seminar in Virology 3:237; Graham and Prevec (1992) 
Biotechnology 20:363). Experiments in administering recombinant adenovirus to different 
tissues include trachea instillation (Rosenfeld et al. (1991) ; Rosenfeld et al. (1992) Cell 
68:143), muscle injection (Ragot et al. (1993) Nature 361:647), peripheral intravenous 
35 injection (Her2 and Gerard (1993) Proc. Natl. Acad. Sci. U.S.A. 90:2812), and stereotactic 
inoculation into the brain (Le Gal La Salle et al. (1 993) Science 254:988). 

Furthermore, the vims particle is relatively stable and amenable to purification and 
concentration, and as above, can be modified so as to affect the spectrum of infectivity. 
Additionally, adenovirus is easy to grow and manipulate and exhibits broad host range in 
vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 109 - 101 1 
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plaque-forming unit (PFU)/ml, and they are highly infective. The life cycle of adenovirus 
does not require integration into the host cell genome. The foreign genes delivered by 
adenovirus vectors are episomal, and therefore, have low genotoxicity to host cells. No 
side effects have been reported in studies of vaccination with vviid-^ype adenovirus 
5 (Couch et al., 1 963; Top et al., 1 971 ), demonstrating their safety and therapeutic potential 
as in vivo gene transfer vectors. Moreover, the carrying capacity of the adenoviral 
genome for foreign DNA is large (up to 8 kilobases) relative to other gene delivery vectors 
(Berkner et al., supra; Haj-Ahmand and Graham (1986) J. Virol. 57:267). Most replication- 
defective adenoviral vectors currently in use and therefore favored by the present 

10 invention are deleted for all or parts of the viral E1 and E3 genes but retain as much as 

80% of the adenoviral genetic materia! (see, e.g., Jones et a!., (1979) Cell 16:683; Berkner 
et al., supra; and Graham et a!., in Methods in Molecular Biology, E.J. Murray, Ed. 
(Humana, Clifton, NJ, 1991) vol. 7. pp. 109-127). Expression of the inserted gene can be 
under control of, for example, the E1 A promoter, the major late promoter (MLP) and 

15 associated leader sequences, the viral E3 promoter, or exogenously added promoter 
sequences. 

Other than the requirement that the adenovirus vector be replication defective, or 
at least conditionally defective, the nature of the adenovirus vector is not believed to be 
crucial to the successful practice of the invention. The adenovirus may be of any of the 42 

20 different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the 
preferred starting material in order to obtain the conditional replication-defective adenovims 
vector for use in the method of the present invention. This is because Adenovirus type 5 
is a human adenovirus about which a great deal of biochemical and genetic information is 
known, and it has historically been used for most constructions employing adenovirus as 

25 a vector. As stated above, the typical vector according to the present Invention is 
replication defective and will not have an adenovirus El region. Thus, it vAW be most 
convenient to introduce the nucleic acid of interest at the position from which the E1 coding 
sequences have been removed. However, the position of insertion of the nucleic acid of 
interest in a region within the adenovinjs sequences is not critical to the present invention. 

30 For example, the nucleic acid of interest may also be inserted in lieu of the deleted E3 

region in E3 replacement vectors as described previously by Karlsson et. al. (1986) or in 
the E4 region where a helper cell line or helper virus complements the E4 defect. 

A preferred helper cell line is 293 (ATCC Accession No. CRL1573). This helper 
cell line, also temned a "packaging cell line" was developed by Frank Graham (Graham et 

35 al. (1 987) J. Gen. Virol. 36:59-72 and Graham (1 977) J.General Virology 68:937-940) and 
provides El A and E1B in trans. However, helper cell lines may also be derived from 
human cells such as human embryonic kidney cells, muscle cells, hematopoietic cells or 
other human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may 
be derived from the cells of other mammalian species that are pemnissive for human 
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adenovirus. Such celis include, e.g.. Vero cells or other monkey embryonic mesenchymal 
or epitheliai cells. 

Various adenovirus vectors have been shown to be of use in the transfer of genes 
to mammals, including humans. Replication-deficient adenovirus vectors have been used 
to express marker proteins and CFTR in the pulmonary epithelium. Because of their ability 
to efficiently infect dividing cells, their tropism for the lung, and the relative ease of 
generation of high titer stocks, adenoviral vectors have been the subject of much research 
in the last few years, and various vectors have been used to deliver genes to the lungs of 
human subjects (Zabner et al.. Cell 75:207-216, 1993; Crystal, et al., Nat Genet. 8:42-51 , 
1994' Boucher, et al.. Hum Gene Ther 5:615-639, 1994). The first generation Ela deleted 
adenovirus vectors have been improved upon with a second generation that includes a 
temperature-sensitive E2a viral protein, designed to express less viral protein and thereby 
make the virally infected cell less of a target for the immune system (Goldman et al., Human 
Gene Therapy 6:839-851 .1 995). More recently, a viral vector deleted of all viral open 
readingframeshasbeenreported(F.sheretal..Virology217:11-22.1995). Moreover.it 
has been shown that expression of viral IL-1 0 inhibits the immune response to adenoviral 
antigen (Qin et a!., Human Gene Therapy 8:1 365-1 374, 1 997). 

Adenoviruses can also be cell type specific, i.e.. infect only restricted types of cells 
and/or express a transgene only in restricted types of cells. For example, the viruses 
comprise a gene under the transcriptional control of a transcription initiation region 
specifically regulated by target host cells, as described e.g., in U.S. Patent No. 5.698,443. 
by Henderson and Schuur, issued December 16, 1997. Thus, replication competent 
adenoviruses can be restricted to certain cells by, e.g.. inserting a cell specific response 
element to regulate a synthesis of a protein necessary for replication, e.g.. El A or E1B. 

DNA sequences of a number of adenovirus types are available from Genbank. For 
example, human adenovirus type 5 has GenBank Accession NO.M73260, The 
adenovirus DNA sequences may be obtained from any of the 42 human adenovirus types 
currently identified. Various adenovirus strains are available from the American Type 
Culture Collection, Rockville. f^aryland. or by request from a number of commercial and 
academic sources. A transgene as described herein may be incorporated into any 
adenoviral vector and delivery protocol, by the same methods (restriction digest, linker 
ligation or filling in of ends, and ligation) used to insert the CFTR or other genes into the 

vectors. , . , [-i 

Adenovirus producer cell lines can include one or more of the adenoviral genes El , 
E2a and E4 DNA sequence, for packaging adenovims vectors in which one or more of 
these genes have been mutated or deleted are described, e.g.. in PCT/US95/15947 (WO 
96/18418) by Kadan et al.; PCT/US95/07341 (WO 95/346571) by Kovesdi et al.; 
PCT/FR94/00624 (W094/28152) by Imler et al.;PCT/FR94/00851 (WO 95/02697) by 
Perrocaudet etal., PCT/US95/14793 (WO96/14061) by Wang et al. 
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AAV Vectors . 

Another viral vector system useful for delivery of DNA is the adeno-assocated 
virus (AAV) Adeno-associated virus is a naturally occurring defective virus that requires 
another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 
replication and a productive life cycle. (For a review, see Muzyczka et al., Curr. Topics ,n 
Micro^ and Immunol. (1992) 158:97-129). 

AAV has not been associated with the cause of any disease. AAV is not a 
.ransfomiing or oncogenic virus. AAV integration into chromosomes of human cell lines does 
not cause any significant alteration in the growth properties or morphological charactenstics 
ol the cells. These properties of AAV also recommend it as a potentially useful human gene 
therapy vector. 

AAV is also one of the few viruses that may integrate its DNA into non-dividing 
cells e g pulmona^ epithelial cells or muscle cells, and exhibits a high frequency of stable 
integration (see for example Flotte et al.. (1992) Am. J. Respir. Cell. Mol. Biol. 7:349-o5o; 
Samulski et al (1989) J. Virol. 63:3822-3828; and McLaughlin et al., (1989) J. Virol. 
62-1963-1973) Vectors containing as little as 300 base pairs of AAV can be packaged 
and can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector 
such as that described in Tratschin et al., (1 985) Mol. Cell. Biol. 5:3251 -3260 can be used 
to introduce DNA into cells. A variety of nucleic acids have been introduced into different 
cell types using AAV vectors (see for example Hermonat et al., (1984) PNAS USA 
81 -6466-6470; Tratschin et al., (1 985) Mol. Cell. Biol. 4:2072-2081 ; Wondisford et al 
(1988) Mol. Endocrinol. 2:32-39; Tratschin et al.. (1984) J. Virol. 51:611-619; and Flotte et 

al (1 993) J. Biol. Chem. 268:3781 -3790). 

The AAV-based expression vector to be used typically includes the 145 nucleotide 
AAV inverted terminal repeats (ITRs) flanking a restriction site that can be used for 
subcloning of the transgene. either directly using the restriction site available, or by 
excision of the transgene with restriction enzymes followed by blunting of the ends ligation 
of appropriate DNA linkers, restriction digestion, and ligation into the site between the ITRs. 
The capacity of AAV vectors is about 4.4 kb. The following proteins have been 
expressed using various AAV-based vectors, and a variety of promoter/enhancers: 
neomycin phosphotransferase, chloramphenicol acetyl transferase, Fanconi's anemia gene, 
cystic fibrosis transmembrane conductance regulator, and granulocyte macrophage colony- 
stimulating factor (Kotin, R.M.. Human Gene Therapy 5:793-801 . 1994, Table 1). A 
transgene incorporating the various DNA constructs of this invention can similarly be 
included in an AAV-based vector. As an alternative to inclusion of a constitutive promoter 
such as CMV to drive expression of the recombinant DNA encoding the fusion protein(s). 
e.g. fusion proteins comprising an activation domain or DNA-binding domain, an AAV 
promoter can be used (ITR itself or AAV p5 (Flotte, et al. J. Biol.Chem. 268:3781-3790, 
1993)). 
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such a vector can be packaged into AAV virions by -P^^^^ -^'^J^^^^ 
example a hur^an cell line such as 293 c^n be co-transfected with the ^AV-based 

to nig py ^, _ ^^^^^^ December 2. 

replication o, Epslein Ba,r.i.s (oriP) and an AAV genome. ^ 

wherein the p5 promoter, which normally controls rep expression, - -P^^^ \ 
,=rni^ Patpnt5658 776.byFlotteetal.,issuedAug. 19, iyy/J- 

, Virol. 63-.3822. Concentration and purification of the virvjs can be ach.eved Dy p 
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methods such as banding in cesium chloride gradients, as was used for the initial report of 
AAV vector expression in vivo (Flotte, et ai. J. Biol. Chem. 263:3781-3790, 1993) or 
chromatographic purification, as described in O'Riordan et ai., WO97/08298. 

Methods for in vitro packaging A.AV vectors are also avaiiable and have the 
advantage that there is no size limitation of the DNA packaged into the particles (see, U.S. 
Patent No. 5,688,676, by Zhou et a!., issued Nov. 18. 1997). This procedure involves the 
preparation of cell free packaging extracts. 

For additional detailed guidance on AAV technology which may be useful in the 
practice of the subject invention, including methods and materials for the incorporation of a 
transgene, the propagation and purification of the recombinant AAV vector containing the 
transgene, and its use in transfecting cells and mammals, see e.g. Carter et a!, US Patent 
No, 4.797.368 (10 Jan 1989); Muzyczka et al, US Patent No. 5,139,941 (18 Aug 1992); 
Lebkowski et al, US Patent No. 5.1 73.41 4 (22 Dec 1 992); Srivastava, US Patent No. 
5.252,479 (12 Oct 1993); Lebkowski et al, US Patent No. 5.354,678 (11 Oct 1994); Shenk 
et at. US Patent No. 5,436,146(25 July 1995); Chatterjee et al. US Patent No. 5,454,935 
(12 Dec 1995), Carter et al WO 93/24641 (published 9 Dec 1993), and Natsoulis. U.S. 
Patent No. 5,622,856 (April 22, 1997). Further information regarding AAVs and the 
adenovirus or herpes helper functions required can be found in the following articles.Berns 
and Bohensky (1987), "Adeno-Associated Viruses: An Update^ Advanced in Virus 
Research, Academic 

Press, 33:243-306. The genome of AAV is described in Laughlin et al. (1983) "Cloning of 
infectious adeno-associated virus genomes in bacterial ptasmids", Gene, 23: 65-73. 
Expression of AAV is described in Beaton et al. (1989) "Expression from the 
Adeno-associated virus p5 and pi 9 promoters is negatively regulated in trans by the rep 
protein; J. Virol., 63:4450-4454. Constoiction of rAAV is described in a number of 
publications: Tratschin et al. (1 984) "Adeno-associated virus vector for high frequency 
integration, expression and rescue of genes in mammalian cells", Mol. CelL Biol., 
4:2072-2081; Hermonat and Muzyczka (1984) "Use of adeno-associated virus as a 
mammalian DNA cloning vector Transduction of neomycin resistance into mammalian tissue 
culture cells-, Proc. Natl. Acad. Sci. USA, 81:6466-6470; McLaughlin etal. (1988) 
-Adeno-associated virus general transduction vectors: Analysis of Proviral Structures". J. 
Virol., 62:1963-1973; and Samulski et al. (1989) "Helper-free stocks of recombinant 
adeno-associated viruses: normal integration dOBsqnate viral gene expression", J. Virol.. 
63:3822-3828. Ceil lines that can be transformed by rAAV are those described in 
Lebkowski et al. (1 988) -Adeno-associated virus: a vector system for efficient introduction 
and integration of DNA into a variety of mammalian cell types", Mol. Cell. BioL, 
8:3988-3996. "Producer" or "packaging" cell lines used in manufacturing recombinant 
retroviruses are described in Dougherty et al. (1989) J. Virol., 63:3209-3212; and 
Markowitz et al. (1 988) J. Virol., 62:1 1 20-1 124. 
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Hybrid Adenovirus-AAV Vectors 

Hybrid Adenovirus-AAV vectors represented by an adenovirus capsid containing a 
nucleic acid comprising a portion of an adenovirus, and 5' and 3' ITR sequences Irom an 
AAV which flank a selected transgene under the control of a promoter. See e.g. Wilson et 
al Intemational Patent Application Publication No. WO 96/13598. This hybnd vector is 
characterized by high titer transgene delivery to a host cell and the ability to stably 
integrate the transgene into the host cell chromosome in the presence of the rep gene. This 
virus is capable of infecting virtually all cell types (conferred by its adenovirus sequences) 
and stable long term transgene integration into the host cell genome (conferred by its AAV 
sequences). 

The adenovirus nucleic acid sequences employed in the this vector can range from 
a minimum sequence amount, which requires the use of a helper virus to produce the 
hybrid virus particle, to only selected deletions of adenovirus genes, which deleted gene 
products can be suppiied in the hybrid viral process by a packaging ceil. For example, a 
hybrid virus can comprise the 5' and 3' inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication). The left terminal sequence (5') 
sequence of the Ad5 genome that can be used spans bp 1 to about 360 of the 
conventional adenovirus genome (also referred to as map units 0-1 ) and includes the S 
ITR and the packaging/enhancer domain. The 3' adenovirus sequences of the hybrid virus 
include the right terminal 3' ITR sequence which is about 580 nucleotides (about bp 
35 353- end of the adenovirus, referred to as about map units 98.4-1 00. 

The AAV sequences useful in the hybrid vector are viral sequences from which the 
rep and cap polypeptide encoding sequences are deleted and are usually the cis acting 5' 
and 3- ITR sequences. Thus, the AAV ITR sequences are flanked by the selected 
adenovirus sequences and the AAV ITR sequences themselves flank a selected 
transgene The preparation of the hybrid vector is further described in detail in published 
PCT application entitled "Hybrid Adenovirus-AAV Virus and Method of Use Thereof, WO 
96/13598 by Wilson et al. 

For additional detailed guidance on adenoviais and hybrid adenovims-AAV 
technology which may be useful in the practice of the subject invention, including methods 
and materials for the incorporation of a transgene, the propagation and purification of 
recombinant virus containing the transgene, and its use in transfecting cells and mammals, 
see also Wilson et al, WO 94/28938, WO 96/13597 and WO 96/26285, and references 
cited therein. 



Retroviruses 

The retroviruses are a group of single-stranded RNA viruses characterized by an 
ability to convert their RNA to double-stranded DNA in infected cells by a process of 
reverse-transcription (Coffin (1990) Retroviridae and their Replication" In Fields, Knipe ed. 
Virology New York: Raven Press). The resulting DNA then stably integrates into cellular 
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Chromosomes as a prowus and d.rects synthes,s of v,ra. prote.ns^ The ,ntegrat.on resuKs 
,n the retent,on of .he v,ra, gene sequences in the rec.p,ent cell and ,.s descendanfs The 
retroviral oenome contains tnree genes, gag, pol, and env tha, code for caps.dal prote.ns, 
polymera;e enzvme, and envelope components, respectively, A sequence ,ound 
pieam from the gag gene, termed ps, , functions as a s.gna, for P-^^^-g o e 9 me 
L vir,ons. Two long term.nal repeat (LTR) sequences are present at the 5 and o ends 
of the viral genome. These contain strong promoter and enhancer sequences and are also 
required for integration in the host cell genome (Coff.n (1 990), supra). 

In order to construct a retroviral vector, a nucleic acid of interest is inserted into the 
v,ral genome ,n the place of certain viral sequences to produce a virus that ,s 
replication-defective. In order to produce vinons, a packaging cell line containing the gag, 
o' and env genes but without the LTR and psi components ,s constructed (Mann et al. 
(1983) Cell 33-153) When a recombinant plasmid containing a human cDNA, together with 
^^^.^ psi sequences is introduced into this cell „ne (by ca.ium phospha e 
precipitation for example), the psi sequence allows the RNA transcnpt of the recombinant 
Lsl to be packaged into viral particles, which are then secreted into the culture media 
(Nicolas and Rubenstein (1988) "Retroviral Vectors". In: Rodriguez ^enhardt ed 
vectors- A Survey of Molecular Cloning Vectors and their Uses. Stoneham:Butten.orth. 
Temin, (1986) "Retrovirus Vectors for Gene Transfer: Efficient integration into and 
Sprein of Exogenous DNA in Vertebrate Cell Genome", In: Kucher apat, ed. Gene 
Transfer New York: Plenum Press; Mann et al., 1 983, supra). The media containing the 
recombinant retroviruses is then collected, optionally concentrated, and used for gene 
transfer. Retroviral vectors are able to infect a broad variety of ^P-^^°^^;"' 
integration and stable expression require the division of host cells (Paskind et al. (1975) 

'■'"'^'aITc'p^ 

particularly with regard to the possibility of the spread of v.ild-type vi-s in the ce^ 
population The development of specialized cell lines (termed "packaging e lls ) wh ch 
reduce only replication-defective retroviruses has increased the utility of 
gene therapy, and defective retroviruses are well characterized for use in gene ran^fer for 
qene therapy purposes (for a review see Miller, A.D. (1 990) Blood 76:271). Thus. 
ecolLnt retrovirus cln be constructed in which part of the retroviral -d-ng sequen^ 
(aaq pol env) has been replaced by nucleic acid encoding a fusion protein of the present 
!n:eTn, iLring the retrovirus replication defective. The replication defective retrov us 
is then packaged into virions which can be used to infect a target cell through the use of a 
hCrusbystandardtechniques. Protocols for producing recombinant reUoviruses and 
for nfecting cells :n ..o or in vivo with such viruses can be found in Current Protocols ,n 
Molecular Biology, Ausubel, P.M. et al.. (eds.) Greene Publishing Assoc ates (1 989), 
Sections 9 10-9.14 and other standard laboratory manuals. Examples of su.tab e 
Sroviruses include pU, pZIP. pWE and pEM which are we., known to those skilled ,n the 
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One class of p65-based transcription factors contain more than one copy of a p65- 
denved doma.n. Such proteins will t^-pically contain or more, generally up to about s,x, 
copies of a peptide sequence comprising all or a portion of p65^361 -550) or pep. e 
so^uertce derived therefrom. Such iterated p65-based transcription activation domains are 
useful both in bundled and non-bundled approaches. be 

Oth-r polypeptides with transcription activation activity in eukaryctic cells can 
used to provide activation units for the fusion proteins of this invention, transcription 
activation domains found within vanous proteins have been grouped into categories based 
upon shared structural features. Types of transcription activation domains include acidic 
transcription activation domains (noted previously), proline-rich transcription activation 
domains senne/threonine-rich transcnption activation domains and glutamine-nch 
tZ:;L activation domains. Examples o, pro„ne-noh activation domains include ammo 
acid residues 399-499 of CTF/NF1 and ammo acid residues 31-76 of AP2. Examp^s of 
senne/threonine-rich transcnption activation domains include amino acid residues 1-427 
,TF1 and amino acid residues 2-451 of 1TF2. Examples of glutamine-nch activation 
domains include ammo acid residues 175-269 of Oct1 and amino acid residues 1o2-243 
Sp1 . The amino acid sequences of each of the above ^'^'^''^^l^'f^^^^^ 
useful transcription activation domains, are disclosed in Seipel, K. et al. (EMBO J. (1992) 

13:4961^4968)^^ illustrative activation domains and motifs of human origin include the 
activation domain of human CTF. the 1 8 amino acid (^^FLQLPQQTQGALLTSQP) 
glutamine rich region of Oct-2. the N-termina. 72 amino acids of p53. the SYGQQS ep^^^^ 
I Ewing sarcoma gene and an 1 1 amino add (535-545) acidic noh region of Re A protein. 

in addition to previously described transcription activation domains, novel 
transcrip^on activation units, which can be identified by standard techniques, are withm 
scope of the invention. The transcription activation ability of a polypept.de can be 
assayed by linking the polypeptide to a DMA binding domainand ^^^^^^^^^^^ '^^^^^ 
of transcription of a target sequence that is stimulated by the fusion protein. For example, 
a standard assay used in the art utilizes a fusion protein of a putative act.va on un d a 
gIu DNA bindfng domain (e.g.. amino acid residues 1 -93). This fusion protein ,s then 
used to stimulate expression of a reporter gene linked to GAL4 binding sites (see e.g.. 
Seipel K etal (1992) EMBO J. 11:4961 -4968 and references cited therein). 

' The activation domains of the invention can be from any eukaryotic species 
(including but not limited to various yeast species and various vertebrate species. 
Including the mammals), and it is not necessary that eve^ activation unit or domam be from 
the same species. In applications of this invention to whole organisms, .t is often 
preferable to use activation units and activation domains from the same species as 
recipient to avoid immune reactions against the fusion proteins. 
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Mutations in the Activation Domain 

One way to increase the potency of an activalion domain is to increase its acidic or 
hydrophobic content through modifications in peptide sequence. Acidic amino acids which 
can increase potency of activation domains include aspartic acid and glutamic acid. In some 
cases, one may want to decrease (usually only modestly) the potency of the activation 
domain in order to obtain a less steep activation curve, especially if a greater number of 
individually weaker activation domains will be deployed together, e.g., by bundling. 

Thus, in one embodiment of this invention, mutations are introduced into the 
activation domain by standard techniques known in the art. such as site-directed PGR 
based mutagenesis. In this embodiment, one to five, in some cases one to three, 
alterations in peptide sequence can be introduced into the DNA coding for the activation 
domain. Each of these mutations either alone or in combination with one or more other 
mutations may be readily assayed for its ability to induce the transcription of either 
transiently transfected or stably integrated target reporter gene constructs. For instance, a 
construct encoding a fusion protein containing multiple copies of the modified sequence and 
a DNA binding domain can be introduced into cells and the activity of the encoded fusion 
protein measured in transcription assays (with a responsive reporter gene construct) and 
compared to analogous fusion proteins containing wild-type activation domain sequence or 

a different mutation of interest. 

The foregoing is illustrated in the case of the p65 transcription activation domains. 
Constructs are prepared encoding fusion proteins containing one or more p65 transcription 
activation domains and a DNA binding domain. The p65 domains may be wild-type (as a 
control) or may contain any of a variety of alterations in peptide sequence. These 
mutations can generally be introduced into a variety of p65-derived transcription activation 
domains For example, mutations can be introduced Into plasmids carrying p65 
activation domain coding regions between amino acids 533 and 550, or 361 and 550, or 
280 and 550. 

Exemplary mutations for p65 transcription activation domains include those 
intended to increase the potency of the p65 activation domain (including the , U2. 
M6,IV17 and M8 mutations) and those intended to decrease the potency (generally slightly) 
of the activation domain. The p65 activation domain contains four phenylalanine residues 
and mutations that convert these residues to alanine has been shown to significantly 
reduce the potency of the p65 activation domain in yeast and in vitro experiments. Our 
experiments show that changing F 533 and F 541 to alanine residues reduced the potency 
of p65 activation domain to half of wild type level. IVlutations of the t^4 and M5 class 
change the conserved serine and proline residues between amino-acids 361 and 450. Our 
data show that M4 and MS mutant sequences can induce the expression of target genes 
synergistically when fused to other acidic type activation domains. In GST pull down 
assays the region of the 1^4 and Mb mutations interacts with TFIIA. Although MA and M5 
mutations individually have a very small effect on the ability of p65 activation domain to 
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induce the reporter gene, corribined together, they significantly reduce its potency. Thu 
mutations for the practitioner to bear in mind include, but are not limited to, the following: 

WT: 532-DFSSIADMDFSALLSQIS 

M1: 532-DFSDFADMDFDADLSQIS 

WT: 439-ALLQLQFDDED 

M2: 439-ALLDLDFDDED 

WT: 529-GDEDFSSIADMDFSALLSQI 

M3: 529-GDEDASSIADMDASALLSQI 

WT: 377-SALALPAPPQVL 

M4: 377-GALALGAGGQVL 

WT: 401-SALAQAPAPVP 

M5: 401-GAU\QAGAGVG 

WT: 434-GTLSEALLQLQFD 

M6: 434-GDFS-ALLQLQFD 

WT: 472-SEFQQLLNQ 

M7: 472-SEFSALLNQ 

WT: 472-SEFQQLLNQ 

M8: 472-SDFQQLLNQ 

WT: 530-DEDFSSlADMDFS 

M9: 530-DEDFSSLLDMDFS 



Synergizing Domains 

A synergizing domain is any domain which cbsen/ably increases the potency of 
transcription activation when recruited to the promoter along with the transcription activation 
domain A synergizing domain can be an independent transcription activation domain or an 
activation unit which on its own does not induce transcription but is able to potentiate the 
activity of a transcription activation domain with which it is linked covalently (i.e., within the 
same fusion protein) or with which it is associated non-covalently (e.g.. through bundling or 
ligand-mediated clustering). _^ 

One example of a synergizing domain is the so-called "alanine/proline rich" or AP 
activation motif of p65, which extends from about amino acids 361 to about amino acd 450 
of that protein. Similar AP activation motifs are also present in, e.g., the p53 and CTF 
proteins. The presence of one or several copies of an AP domain alone in a protein does 
not itself provide the ability to induce activator-dependent transcription activation. 
However, when linked to activation units which are themselves capable of inducing some 
level of activator-dependent transcription, e.g., another portion of p65 or VP1 6, the AP 
activation unit synergizes with the second activation domain to induce an increase in the 
level of transcription. 
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Accordingly, the invention provides an AP activation unit, functional derivative 
thereof or other synergizing domain which on its own is incapable ol activating 
transcription. Functional alternative sequences for use as synergizing domains, including 
among others derivatives of an AP activation unit, can be obtained, for instance, by 
screening candidate sequences for binding to TFIIA and measuring transcriptional activity ,n 
a co-transfection assay. Such equivalents are expected to include forms of the activation 
unit which are truncated at either the N-terminus or C-terminus or both, e.g.. fragments of 
p65 (or homologous sequences thereto) which are about 75, 60, 50, 30 or even 20 ammo 
acid residues in length (e.g., ranging in length from 20-89 amino acids). Likewise, it is 
expected that the AP activation unit sequence from p65 can tolerate ammo acid 
substitutions, e.g., to produce AP motifs of at least 95%, 90%, 80% and even 70% 
identity with the AP activation unit sequence of SEQ ID No. 2 of USSN 08/918,401. 
These and other AP derivatives include, for example, AP domains based on naturally- 
occurring sequence but modified by the replacement, insertion or deletion of 1 . 2. 3, 4 or 5 
amino acid residues. 

Other synergizing domains are independent activation domams. e.g. VP16. While 
VP16 can activate transcription on its own. it can synergize with p65 to produce levels of 
transcription that are greater than the sum of the transcription levels effected by each 
activation domain alone. As shown in the examples, fusion of VP1 6 to a nucleic acid 
containing an FRB domain, a lac repressor tetramerization domain and p65 greatly 
increases the level of expression of a target gene as compared to the same construct .n the 

absence of VP1 6. .r^.,,^-.- 

Synergizing domains may also be fused to an unbundled or bundled DMA binding 
domain To avoid the activation of transcription in a constitutive manner with constructs 
such as these, it is preferable that the synergizing domain itself be incapable of activating 
transcription. 

Ligand binding domains 

Fusion proteins containing a ligand binding domain for use in practicing this 
invention can function through one of a variety of molecular mechanisms. 

In certain embodiments, the ligand binding domain pemiits ligand-mediated cross- 
linking of the fusion protein molecules bearing appropriate ligand binding domains. In these 
cases the ligand is at least divalent and functions as a dimerizing agent by binding to the 
two fusion proteins and forming a cross-linked heterodimeric complex which activates target 
gene expression. See e.g. WO 94/18317, WO 96/20951, WO 96/06097, WO 97/31898 
and WO 96/41865. 

in other embodiments, the binding of ligand to fusion protein is thought to result in 
an allosteric change in the protein leading to the binding of the fusion protein to a target 
DNA sequence [see e.g. US 5.654,168 and 5.650,298 (tet systems), and WO 93/23431 
and WO 98/18925 (RU486-based systems)] or to another protein which binds to the 



25 



wo 99/10510 



PCT/US9S/I77:3 



target DNA sequence [see e.g. WO 96/37609 and WO 97/381 17 (ecdysone/RXR-based 
systenns)], in either case, modulating target gene expression. 

Dimerization-based systems 
5 In the cross-iinking-based dimerization systems the fusion proteins can contain one 

or more ligand binding domains (in some cases containing tvyo, three or four such domains) 
and can further contain one or more additional domains, heterologous v^'ith respect to the 
ligand binding domain, including e.g. a DNA binding domain, transcription activation domain, 
etc. 

10 In general, any ligand/ligand binding domain pair may be used in such systems. For 

example, ligand binding domains may be denved from an immunophilin such as an FKBP, 
cyclophiiin, FRB domain, hormone receptor protein, antibody, etc., so long as a ligand is 
known or can be identified for the ligand binding domain. 

For the most part, the receptor domains will be at least about 50 amino acids, and 

15 fewer than about 350 amino acids, usually fewer than 200 amino acids, either as the 

natural domain or tnjocated active portion thereof. Preferably the binding domain will be 
small {<25 kDa, to allow efficient transfection in viral vectors), monomeric, nonimmunogenic, 
and should have synthetically accessible, cell permeant, nontoxic ligands as described 
above. 

20 Preferably the ligand binding domain is for (i.e., binds to) a ligand which is not itself 

a gene product (i.e., is not a protein), has a molecular weight of less than about 5 RD and 
preferably less than about 2.5 kD, and is cell permeant. In many cases it will be preferred 
that the ligand does not have an intrinsic pharmacologic activity or toxicity which interferes 
with its use as a transcription regulator. 

25 The DNA sequence encoding the ligand binding domain can be subjected to 

mutagenesis for a variety of reasons. The mutagenized ligand binding domain can provide 
for higher binding affinity, allow for discrimination by a ligand between the mutant and 
naturally occurring forms of the ligand binding domain, provide opportunities to design 
ligand-ligand binding domain pairs, or the like. The change in the ligand binding domain can 

30 involve directed changes in amino acids known to be involved in ligand binding or with 
ligand-dependent conformational changes. Alternatively, one may employ random 
mutagenesis using combinatorial techniques. In either event, the mutant ligand binding 
domain can be expressed in an appropriate prokaryotic or eukarotic host and then 
screened for desired ligand binding or conformational properties. Examples involving FKBP, 

35 cyclophiiin and FRB domains are disclosed in detail in WO 94/1 831 7, WO 96/06097, WO 
97/31898 and WO 96/41865. For instance, one can change Phe36 to Ala and/or Asp37 to 
Gly or Ala in FKBP12 to accommodate a substituent at positions 9 or 10 of the ligand 
FK506 or FK520 or analogs, mimics, dimers or other derivatives thereof. In particular, mutant 
FKBP12 domains which contain Val, Ala, Gly, Met or other small amino acids in place of one 

40 or more of Tyr26, Phe36, Asp37, Tyr82 and Phe99 are of particular interest as receptor 
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domains for FK506-t^'pe and FK-520-type ligands containing nnodifications at C9 and/or 
C1 0 and their synthetic counterparts (see e.g., WO 97/31 898). Illustrative mutations of 
current interest in FKBP domains also include the following: 



F36A 


Y26V 


F46A 


W59A 


F36V 


Y26S 


F48H 


H87W 


F36M 


D37A 


F48L 


H87R 


F36S 


I90A 


F48A 


F36V/F99A 


F99A 


191A 


E54A/F36V/F99G 


F99G 


F46H 


£54K/F36^A^F99A 


Y26A 


F46L 


V55A 


F36M/F99G 







5 

Table 1 : Entries identify the native amino acid by single letter code and sequence position, 
followed by the replacement amino acid in the mutant. Thus, F36V designates a human 
FKBP12 sequence in which phenylalanine at position 36 is replaced by valine. 
F36V/F99A indicates a double mutation in which phenylalanine at positions 36 and 99 are 
10 repiacedby valine and alanine, respectively. 

Illustrative examples of domains which bind to the FKBP:rapamycin complex 
("FRBs") are those which include an approximately 89-amino acid sequence containing 

15 residues 2025-211 3 of human FFIAP. Another FRAP-derived sequence of interest 
comprises a 93 amino acid sequence consisting of amino acids 2024-21 13. Similar 
considerations apply to the generation of mutant FRAP-derived domains which bind 
preferentially to FKBP complexes with rapamycin analogs (rapalogs) containing 
modifications (i.e., are 'bumped') relative to rapamycin in the FRAP-binding portion of the 

20 drug. For example, one may obtain preferential binding using rapalogs bearing 

substituents other than -OMe at the 01 position with FRBs based on the human FRAP 
FRB peptide sequence but bearing amino acid substitutions for one of more of the residues 
Tyr2038, Phe2039, Thr2098, Gln2099, Trp2101 and Asp2102. Exemplary mutations 
include Y2038H, Y2038L, Y2038V, Y2038A, F2039H. F2039L, F2039A, F2039V, D2102A, 

25 T2098A, T2098N, T2098L, and T2098S. Rapalogs bearing substituents other than -OH at 
028 and/or substituents other than =0 at 030 may be used to obtain preferential binding 
to FRAP proteins bearing an amino acid substitution for Glu2032. Exemplary mutations 
include E2032A and E2032S. Proteins comprising an FRB containing one or more amino 
acid replacements at the foregoing positions, libraries of proteins or peptides randomized at 

30 those positions (i.e., containing various substituted amino acids at those residues), 

libraries randomizing the entire protein domain, or combinations of these sets of mutants are 
made using the procedures described above to identify mutant FRAPs that bind 
preferentially to bumped rapalogs. 



27 



wo 99/10510 



PCT;US9S/r72J 



Other macroIicJe binding domains useful in the present invention, including mutants 
thereof, are described in the art. See, tor example, W096/41865, VVOQ6/13613, 
VVO96/06111. WO96/06110, VVO96/06097, W096/12796, WO95/05389, VVO95/02634, 
VV094/18317. 

5 The ability to employ in vitro mutagenesis or combinatorial modifications of 

sequences encoding proteins allows for the production of libraries of proteins which can be 
screened for binding affinity for different ligands. For example, one can randomize a 
sequence of 1 to 5, 5 to 10, or 10 or more codons, at one or more sites in a DNA sequence 
encoding a binding protein, make an expression construct and introduce the expression 

10 construct into a unicellular microorganism, and develop a library of modified sequences. 
One can then screen the library for binding affinity of the encoded polypeptides to one or 
more ligands. The best affinity sequences which are compatible with the cells into which 
they would be introduced can then be used as the ligand binding domain for a given ligand. 
The ligand may be evaluated with the desired host cells to determine the level of binding of 

15 the iigand to endogenous proteins. A binding profile may be detemined for each such 

ligand which compares ligand binding affinity for the modified ligand binding domain to the 
affinity for endogenous proteins. Those ligands which have the best binding profile could 
then be used as the ligand. Phage display techniques, as a non-limiting example, can be 
used in carrying out the foregoing. 

20 In other embodiments, antibody subunits, e.g. heavy or light chain, particularly 

fragments, more particularly all or part of the variable region, or single chain antibodies, can 
be used as the ligand binding domain. Antibodies can be prepared against haptens v/hich 
are pharmaceutically acceptable and the individual antibody subunits screened for binding 
affinity. cDNA encoding the antibody subunits can be isolated and modified by deletion of 

25 the constant region, portions of the variable region, mutagenesis of the variable region, or 
the like, to obtain a binding protein domain that has the appropriate affinity for the ligand. 
in this way, almost any physiologically acceptable hapten can be employed as the iigand. 
Instead of antibody units, natural receptors can be employed, especially where the 
binding domain is known. In some embodiments of the invention, a fusion protein 

30 comprises more than one ligand binding domain. For example, a DNA binding domain can 
be linked to 2, 3 or 4 or more ligand binding domains. The presence of multiple ligand 
binding domains means that ligand-mediated cross-linking can recruit multiple fusion 
proteins containing transcription activation domains to the DNA binding domain-containing 
fusion protein. 

35 

Allostery-based systems 

As mentioned previously, systems for transcription regulation based on ligand- 
dependent allosteric changes in a chimeric transcription factor are also useful in practicing 
the subject invention. One such system employs a deletion mutant of the human 
40 progesterone receptor which no longer binds progesterone or other endogenous steroids 
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but can be activated by the oratiy active progesterone antagonist RU486, described, e.g., 
in Wang et a!. (1994) Proc. Natl. Acad. Sci. U.S.A. 91:8180. Activation was demonstrated 
in cells transplanted into mice using doses of RU486 (5-50 g/kg) considerably below the 
usual dose for inducing abortion in humans (10 mgA<g). However, the reported induction 
ratio in culture and in animals was rather low. 

Another such system is the ecdysone inducible system. Early work demonstrated 
that fusing the Drosophila steroid ecdysone (Ec) receptor (EcR) Ec- binding domain to 
heterologous DNA binding and activation domains, such as E. coli lexA and herpesvirus 
VP1 6 permits ecdysone-dependent activation of target genes downstream of appropriate 
binding sites (Chhstopherson et al. (1992) Proc. Natl. Acad, Sci. U.SA 89:6314). An 
improved ecdysone regulation system has been developed, using the DNA binding 
domain of the EcR itself. In this system, the regulating transcription factor is provided as 
two proteins: (1) a truncated, mutant EcR fused to herpes VP16 and (2) the mammalian 
homolog (RXR) of Ultraspiracle protein (USP), which heterodimerizes with the EcR (No et 
al. (1996) Proc. Natl. Acad. Sci. U.S.A. 93:3346). In this system, because the DNA 
binding domain was also recognized by a human receptor (the human famesoid X 
receptor), it was altered to a site recognized only by the mutant EcR. Thus, the invention 
provides an ecdysone inducible system, in which a truncated mutant EcR is fused to at 
least one subunit of a transcription activator of the invention. The transcription factor further 
comprises USP, thereby providing high level induction of transcription of a target 
genehaving the EcR target sequence, dependent on the presence of ecdysone. 

In another approach, the inducible system comprises or is derived from the 
E. coli tet repressor (TetR), which binds to tet operator (tetO) sequences upstream of 
target genes. In the presence of tetracycline, or a tetracycline analog which bind to tetR, 
DNA binding is abolished and thus transactivation is abolished. This system, in which the 
TetR had previously been linked to transcription activation domains, e.g. from VP1 6, is 
generally referred to as an allosteric "off-switch" described by Gossen and Bujard (Proc. 
Natl. Acad. Sci. U.S.A. (1992) 89:5547) and in U.S. Patents 5,464,758; 5.650.298; and 
5,589,362 by Bujard et al. Target gene expression is reportedly regulatable over several 
orders of magnitude in a reversible manner. This system is said to provide low 
background and relatively high target gene expression in the absence of tetracycline or an 
analog. The invention described herein provides a method for obtaining even stronger 
transcription induction of a target gene, which is regulatable by the tetracycline system or 
other inducible DNA binding domain. 

In some embodiments, a "reverse" Tet system is used, again based on a DNA 
binding domain that is a mutant of the E. coli TetR, but which binds to TetO in the 
presence of Tet. Additional information on mutated tetR-based systems is provided above 
and in patent documents cited previously. The use of bundling as described herein 
provides a method for obtaining even stronger transcription induction of a target gene in the 
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presence of tetracycline or an analog thereof from a very iow background in the absence of 
tetracycline. 

A tetR domain useful in the practice of this invention may comprise a naturally 
occurring peptide sequence of a tetR of any of the various classes (e.g. class A, B, C, D 

5 or E) (in which case the absence of the ligand stimulates target gene transcription), or more 
preferably, comprises a mutated tetR which is denved from a naturally occurring sequence 
from which it diflers by at least one amino acid substitution, addition or deletion. Of 
particular interest are those mutated tetR domains in which the presence of the ligand 
stimulates binding to the TetO sequence, usually to induce target gene transcription in a 

10 cell engineered in accordance with this invention. For example, mutated tetR domains 
include mutated Tn10-denved tetR domains having an amino acid substitution at one or 
more of amino acid positions 71 , 95, 101 and 102. By way of further illustration, one 
mutated tetR comprises amino acids 1 - 207 of the Tnl 0 tetR in which glutamic acid 71 is 
changed to lysine, aspartic acid 95 is changed to asparagine, leucine 101 is changed to 

15 serine and glycine 1 02 is changed to aspartic acid. Ligands include tetracycline and a wide 
variety of analogs and mimics of tetracycline, including for example, anhydrotetracydine 
and doxycycltne. Target gene constmcts in these embodiments contain a target gene 
operably linked to an expression control sequence including one or more copies of a DNA 
sequence recognized by the tetR of interest, including for example, an upstream activator 

20 sequence for the appropriate tet operator. See e.g. US Patent No. 5,654.1 68. 

Ligands of the invention 

In various embodiments where a ligand binding domain for the ligand is 
endogenous to the cells to be engineered, it is often desirable to alter the peptide 

25 sequence of the ligand binding domain and to use a ligand which discriminates between the 
endogenous and engineered ligand binding domains. Such a ligand should bind 
preferentially to the engineered ligand binding domain relative to a naturally occurring 
peptide sequence, e.g., from which the modified domain was denved. This approach can 
avoid untoward intrinsic activities of the ligand. Significant guidance and illustrative 

30 examples toward that end are provided in the various references cited herein. 

CrosS'linking/dimerization systems 

Any ligand for which a binding protein or ligand binding domain is known or can be 
identiified may be used in combination with such a ligand binding domain in carrying out this 
35 invention. 

Extensive guidance and examples are provided in WO 94/18317 for ligands and 
other components useful for cross-linked oligomerization-based systems. Systems based 
on ligands for an immunophilin such as FKBP, a cyclophilin, and/or FRB domain are of 
special interest. Illustrative examples of ligand binding domain/ligand pairs that may be 
40 used for cross-linking include, but are not limited to: FKBP/FK1 012, FKBP/synthetic 
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divaient FKBP ligands (see WO 96/06097 and WO 97/31898). FRB/rapamycin or analogs 
thereofiFKBP (see e.g.. WO 93/33052. WO 96/41865 and Rivera et al, "A humanized 
system for pharmacologic control of gene expression", Nature Medicine 2(9):1 028-1 032 
(1997)). cyclophiiin/cyclosporin (see e.g. WO 94/18317), FKBP/FKCsA/cyclophilin (see 
e.g. Belshaw et al, 1996, PNAS 93:4604-4607), DHFR/methotrexate (see e.g. Licitra et al, 
1995. Proc. Natl. Acad. Sci. USA 93:12817-12821), and DNA gyrase/coumermycin (see 
e.g. Farraret al. 1996, Nature 383:178-181). Numerous variations and modifications to 
ligands and ligand binding domains, as well as methodologies tor designing, selecting 
and/or characterizing them, which may be adapted to the present invention are disclosed in 
tf^e cited references. 

Allosiery-based systems 

For additional guidance on ligands for other systems which may be adapted to this 
invention, see e.g. (Gossen and Bujard Proc. Natl. Acad. Sci. U.S.A. 1992 89:5547, and 
US Patent Nos. 5654168, 5650298, 5589362 and 5464758 (TetR/tetracycline), Wang et al. 
1994, Proc. Natl. Acad, Sci. USA 91:8180-8184 (progesterone receptor/RU486), and No et 
al, 1996, Proc. Natl. Acad. Sci. USA 93:3346-3351 (ecdysone receptor/ecdysone). 



DNA-binding domains 

Regulated expression systems relevant to this invention involve the use of a 
protein containing a DNA binding domain to selectively target a desired gene for 
expression (or repression). Systems based on ligand-mediated cross-linking generally rely 
upon a fusion protein containing the DNA binding domain together with one or more ligand 
binding domains. One general advantage of such systems is that they are particularly 
modular in nature and lend themselves to a wide variety of design choices. These systems 
permit wide latitude-in the choice of DNA binding domains. Many aliostery-based 
systems, like the TetR- and progesterone-R-based systems, use a fusion protein 
containing a DNA binding domain together with a transcription regulatory domain (e.g. a 
transcription activation or repression domain). Some allostery-based systems such as the 
ecdysone-regulated system, use a protein like RXR which contains a DNA binding domain 
together with a binding site for another protein (such as the ecdysone receptor).<Of the 
allostery'-based systems, the progesterone receptor-based system and like systems 
permit relatively greater latitude in the choice of DNA binding domain. While allosten/-based 
systems like the TetR- and ecdysone receptor type may be engineered at the DNA 
binding domain, they are somewhat less amenable to ready replacement of the DNA 
binding domain. 

Various DNA binding domains may be incorporated into the design of fusion 
proteins of this invention, especially those of the ligand-mediated cross-linking type and 
the progesterone-R-based type, so long as a corresponding DNA "recognition" sequence 
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is known, or can be identified, to which the domain is capable of binding. One or more 
copies of' the recognition sequence are incorporated into, or present within, the expression 
control sequence of the target gene construct. Peptide sequence of human origin is often 
preferred, where available, for uses in human gene therapy. Composite DNA binding 
domains provide one means for achieving novel sequence specificity for the protein-DN.'\ 
binding interaction. An illustrative composite DNA binding domain containing component 
peptide sequences of human origin is ZFHD-1 which is described in detail below. 
Individual DNA-binding domains may be further modified by mutagenesis to decrease, 
increase, or change the recognition specificity of DNA binding. These modifications can be 
achieved by rational design of substitutions in positions known to contribute to DNA 
recognition (often based on homology to related proteins for which explicit structural data 
are available). 

For example, in the case of a homeodomain, substitutions can be made in amino 
acids in the N-terminal arm, first loop, second helix, and third helix known to contact DNA. In 
zinc fingers, substitutions can be made at selected positions in the DNA recognition helix. 
Alternatively, random methods, such as selection from a phage display libran/ can be used 
to identify altered domains with increased affinity or altered specificity. 

For additional examples, information and guidance on designing, mutating, selecting, 
combining and characterizing DNA binding domains, see, e.g., Pomerantz JL, Wolfe SA, 
Pabo CO, Structure-based design of a dimeric zinc finger protein Biochemistry 1998 Jan 
27;37(4):965-970; Kim J-S and Pabo CO, Getting a Handhold on DNA: Design of Poly- 
Zinc Finger Proteins with Femtomclar Dissociation Constants, PNAS USA, 1998 Ma: 
17;95(6):2812-2817; Kim JS, Pabo CO, Transcriptional repression by zinc finger 
peptides. Exploring the potential for applications in gene therapy. , J Biol Chem 1997 Nov 
21;272(47):29795-29800; Greisman HA, Pabo CO . A general strategy for selecting high- 
affinity zinc finger proteins for diverse DNA target sites, Science 1 997 Jan 
31;275(5300):657-661; Rebar EJ, Greisman HA, Pabo CO, Phage display methods for 
selecting zinc finger proteins with novel DNA-binding specificities, Methods Enzymol 
1 996:267:129-149; Pomerantz JL, Pabo CO, Sharp PA , Analysis of homeodomain 
function by structure-based design of a transcription factor. Proc Natl Acad Sci U S A 1995 
Oct 10;92(21):9752-9756; Rebar EJ, Pabo CO. Zinc finger phage: affinity selection of 
fingers with new DNA-binding specificities. Science 1994. Feb 4;263:671-673; Choo Y, 
Sanches-Garcia I, Klug A, In vivo repression by a site-specific DNA-binding protein 
designed against an oncogenic sequence, Nature 1994, Dec 15;372:642-645; Choo Y, 
Klug A, Toward a code for the interaction of zinc fingers with DNA: Selection of randomized 
fingers displayed on phage, PNAS USA, Nov 1 994; 91 :1 1 1 63-1 1 1 67; Wu H, Yang W-P, 
Barbas CF 111, Building zinc fingers by selection: toward a therapeutic application, PNAS 
USA January 1995; 92:344-348; Jamieson AC, Kim S-H. Wells JA, In Vitro selection of 
zinc fingers with altered DNA-binding specificity. Biochemistry 1994. 33:5689-5695; 
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,„,erna.iona, pa>»m app.,=a.ions WO 9«!093, , WO 94„ 83,7, WO 96,06,66 and WO 

95/1 9431 ; and USSN 60/08481 9. 

Additional domains and linkers 

' ' JL^ino a..s, .,e,.a ,o aa a ^^^^^^^^^^^ 

Bustosetal {1991)BiochimicaetBiophysicaActa1071.83 10l).unei 

red ,0. L NLS o, ,.e SV40 ,a,9e T an.,9en wN* .s =7»";i':,.,„,, 
p,o,ine-l,slt>e-lyslne.|ysl-,e.a,ginine.lyslne-valiaa (Kalderon a. al. (1984) Call 39^499 509). 

, J.03.aL NLS is *,ived .-0. a p33 p,o,e.. '"^^'^^f^^^^^ZTZ o. 
lerminal nuclaar localization signals, compnsing rescues 31 6-325, 369 375 ana 3/ 
sT snaulsK, e, al. ,1990) Mol. Oall. Biol. 10:6565-65^7). Olher .L s are descnbed b, 
Shaulsky e( al (1 990) supra and Shaulak, et al. (1 991) Oncogene 6.2056. 

X laciliL Jei, deleCon and,or pudlicaticn, ,he fusion pro^ins con,™ 
30 peplide ponlons suc^ as -histidlne Ugs". a glu,a«,ione-S..,ans,erase don«,n o, an 

and their use in fusion proteins are known. See e.g. Huston et al. (^^SS) P'^^^^^^ 
U.S. Patent No. 5.091.513; and Richardson et al. (1988) Scenes 240.1648-1652. 

" '^^^^^g:t;— uctco^prisesageneofint^^^^^ ^ ^ 

exp— rs;uence which permits iigand-reguiated exp^ss.o^^^^^^^^^^^^ More 
specifically, such a construct typically comprises: (1) one or eln (or by 

sequence recognized by a DNA binding dor^ain of a fus,on prote, of he — 
35 a DNA binding protein like RXR which binds to a fusion prote.n of the ^^^'^^if 
remoter seq ence consisting minimally of a TATA box and inifator sequence u 
ptlaliy in luding other transcription factor binding sites; (3) sequence encod n the 
d : product, including sequences that promote the initiation and te— ^^^^^^^^ 
translation, if appropnate; (4) an optional sequence 
40 acceptor, and intervening intron DNA; and (5) a sequence d.rectmg cleavage 
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, H ,n„on of the resulting RNA transcnpt. Typically the construct contams a copy o( 
polyadenylato. of the re g ^^^^^^ ^^^^^^^.^^ ^^^^^^^^ 

responsive io me transcrlplion laaor. 

Targe, genes ,^,,,3 
encode a .lie.apeulic p-o.e.n, ' , ™, bo 

™"*r':'^:rr e I^^^^^^ „Lp,L.spann:ng p,o,e,n. a sec.ed p,o,e,p, o. a 

r;o:r:c;:r;e.o.o,nsw..--ee.p^...^^^^^^^^^^^^^^ 

rr;C"-^e.gepeU.eo.pa«a,oocu.^ 

peplido sequence. hormones, suoh as insulin, human growth 

growth factors, such as EGF, IGh i ' J ^^^^ factors, nerve growth factors. 

FGF. erythropoietin, thror.bopo,et,n, "^^gakaryocyt c g 
e.c.; proteins Which stimulateorinhibitang.^^^^^^^^^^^ 

protein a and su„aoe men,bn,ne protein o, a p«e,n 

The gene can encode a naiuiuny y tr^ncmembrane sequence. 

..desob,,ntroduc.iono,anapprophatesgna,pep«ea„d^ 

Va.dPssucn.oteins«^^^^^^^^^ 

„,arkers. e.gXD3 CD4 c , .^^eptors, such as the interleukin 

CD10CDt9^CDJ8^CD^3,™^ 

receptors 1L-2R, 1L-4H, ere, rt^o h .^^ontnr^ and soluble forms of such 

--^=;r:— ^^^^^^^^^^ 

proxima, o « ^ ^ ,„ eotgl apparatus (o, paOaging in a 

.„ Te:;! cr- ^^^^^^^^^^ 
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one 0, bo.h Chains o T- ec^p» ^^^^^^^ ^^^^^ ^ ^ ^^^^^^^^ 

Chains 10, an "™"°9lob* Jo, s^^^ „ .g^ms. inlecion. b, 

oaacial cell wall p,oleins, .a,ious anti-v.al P"«.<^^- 3,,, 

l„ .a,ious si.ua.ions. one ma, »sh .0 di,ec. a » "J^^^^^^^^ ,„ng „, 
incwa analonica, si.es. such as lymph nodes, '^^''^"'^^.^".^^ ^^^J 
o.h,,,n.=,na, o,3ans o,.nnc,ion. *s such«co«^^^^^^^^ 

™nlpaa.ion. innammaBon, infeCon * ''f^^'^'^;^^^^,,^ a me.hoci 

*h,eco,nizeso,«s,o.he^«^^^^^^^^^ 

"*':iro.:r:: — 

::r:=-e,c.P,o..^o,in.e,esU^^^^^^^ 

:Spr:rrrp:=^^ 

. - , fnr HNA bindinQ domain of one of the 

~„«d,e,u.a.iono,.he,,ansc.p.iono,,neendo,^^^^^ 

(b) Minimal Promoters. , ^ . , opne construct (or other 

, -.a,pro.otersw«^ 

construct of the .nvent.on) may be sele^ed fro ^^^^ ^^^^^^ 

includ,ng promoter regions from fos, hCMV ^VJ an^ ^.^,^3, ^,,3 promoter 

examples are provided which use a m,n,mal ^^^^1"^^''°^ 6-3042-3049, 1986) 
(.72 to .45 with respect to the start site; S,ebenl,st et al.. MCB 6.30 

(c) DNA recognition sequences. ^^^^ 
The choice of recognition sequences to use ,n the target gen 
cases determined by the nature of the regulator system to be employed. 
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:r::C--«..«2— ^^^^^^ 

binding domain). crosslinking systems and systems like th. 

Recognition sequences i -de Y ^^^^^ 
DNA recognition sequences for other DNA bnd g ^^^^^^^^^^^ 
expenmellly. In the case oi a co-PO^- ^^^.^^^ manipulated to d.ect 

sequences car. be determined -Pen- n^'^V- ^ .cognition sequence 

he,r speciiicity toward a desired sequence. ^^^^^ spanning at least 

Laclposite CNA binding d—^^^^ 

ten, preferably eleven, n.ore P^^*^^^^^ j^.^'^^^ .,,3 ^p,tative or demonstrated) w.th>n 
cases eighteen bases. The ^^^^^^^^ou. they .ay be interspersed w 
the nucleotide sequence need not be tul^ y ^^.^ p^^^^,, ^ut rather 

^ oe. base pairs that need not be ^re^ ^^^^^^^^ 

r^r^engineeredON^^^^^^^^^^ 

TO identify a nucleotide -Quence t^a - J^^^^ (,.,3ociat,on 
containlngaDNA-blndlng region. pref.aby^^^^^ 

,3 constant10-11 M or lower are espec ally p^^^^^^^^^^^ 
affinityblndingsltesforindWidualsuo^^^^^^ 

already known, then these sequences " ^elow for methods for 

Ithl optimum conftgurationdeterm.^^^^^^^ 

determining aff.ni.es). ^^^^^^'^'^''^^^^^^ DNA sequences by adapta >on of 
,0 complexcanbeselectedfromalargepoolof ra ^ 33,3.,tlve n^ethod for the 

" Zhed methods (Pollock, a and 1^^- ^^^.^^ ,3. e.97-6204). Bound 
determination 0, protein-DNA b,nd,ng =P^^^^^ '^^^^^^ ^ and affinity for the 

sequencesare cloned intoap—^^^^ 

proteins are determined. From th>s col ed on ^ tor 
appropriate level of affinity and specificity. 



36 



PCT/US98/177:3 



WO 99/10510 



• c of a DNA recognition 



GAL4 or for ZFHDI 



„,3ig„ assembly 0. ™' ^"^ ,.e p*=.pl3S. «*e 

examples a.d ma.e.ials and me«ds '^"""^^^^J^,,^^ as descriPed. 
e i Cited he*, «»P mpd«ca«oas ''^'^^ll„>>or,.^ „a,s, wPere me cod,n9 

such uses are well known .n the art. va 

>5 available. 

introduction of Constructs into CeUs ,3,13 and in 

This invention is particularly useful for ^ 9 ^,-,^^1 cells may be 

^^^^ 

: yUde an, o, .Pe nu.ea.eP =* ^J^,^,,,, and ,iP-*las.s. «so o. 
„mpPoldo,m,etompn»y<.c wages, as »e 
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„«,es, a,s s,en a..-P»s.n,,o, co,s. s.* as h,™,opoi*, neural, slro™,, ™s=.e, 
*pa,lc. pul^ona^, *»o,p,es,ina, and ,,,3 ,„ 

.ne c* ,na, pe ^-^z:::^::^^^^^^^- 

some cases, xenogeneic cells With respect to an .■mmp"^ nrofile bv 

,„a*a,in9 B2™,ogloPa.n » p,pvep, .Pe >^'^'"^ '^^^^^^ Z. MHC n,o,ecules, 

,enas ass^.^P .a, Pe o, ,n«es,. »n.e ,Pe 

„,,pr:=CsrasTceLp.Bo*na».aspec,,,cap,,. 

,„.„,-r:::«P.Pcel,o.eoe,,sa3Pne.— ^^^^^^ 

P„„s.p».s, IP n,ap, pasas in --'^^fj^ ^ ™:r: ^ pa,ed in pon.en.lppa, 

host cells Which contain the construct(s). The constructs can y h 

:a:l.«opaipgsa,„epcesapd.,«^^^^ 

-™^tpP..Pc,s,pnPPCP.p,e.^^^^^^^^^^^^^^^^^ 
sepuencas may then be inlroduped in.p a host pe« by any 

PC stn^ots may be Incp-porated '"'-f- -^"'^te TottS p^^-^^^^ 
EBV veptpp.) Pr into veptp-s designed to, ,n.egrat,pn * "^'^^^^^^^^^ „„,3 
e„„s,.P.sma,bein,eg,atedandpaP^ge^*n^^^^^^^^^ 

like Adenovirus, Adeno-assooated Virus (AAV), or n rp . .^^^ively the 

,o,udingretro.ra, vectors, torinfectionor.^^^^^^^^^^ 
3 construct may be introduced by protoplast fus.on. ^^^^^^^^ ^^^^ ^^,,3 

phosphate transfection, lipofection, --^f °' the construct(s). 
some cases be grown and expanded in culture oe 

intea ration of 

followed by the appropriate treatment lor '"^-^-^^^^^^^^^ of a marker 

the construct(s). The cells will then be expanded and ^^^"^^^^^^^ 
, presentintheconstructs.Variousmarkerswh»^^^^^ 

neomycin resistance, thymidine kinase. ^^VS-^V-" ^^1"^^^^^^^^^ 
surface markers such as Tac, CDS, CDS. Thy1 ^ ^^^^^^^^^^^ 

,n some instances, one may have a target s,te homo^o" 
.,ereit,sdesiredthatac.nstructbeintegrat^atap^^^^^^^^^^^ 

,0 delete and/or replace an endogenous gene (at the same 
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. . nonotn ,rt ot Ihis invention. For homologous recombination, one may 
rprnmhinant larqet construci oi iiiib nivctiv.vy ^ ^^h/-ior7\ 

enet; use eLr a or O-vectors, See, ,or example, Thomas and Capecch. C^^ 987) 
51 , 503 512; Mansour, et a/., Nature (1 988) 336, 348-352; and Joyner, et a/.. Nature 

' '''^ The — may be ,ntroduced as a s.ngie DNA molecule encoding all o, the 
genes o d, rent DNA molecules hav,ng one or more genes. The constructs rnay be 
lduceds,multaneous,yorconsecut,ve,,eachw.,hthesameord«^^^^^ 

Vectors containing useful elements such as bacterial or yeast ongins of replication, 
selectable and/or amplifiable markers, promoter/enhancer "J^^^^^^^^^^^^^^ 
nrokarvotes or eukaryotes, and mammalian expression control elements, etc. which may 
be u^d to prepare sicks of construct DNAs and lor carding out transiections are well 
known in the art. and many are commercially available. 

Introduction of Constructs into Animals 

A V means lor ,he W-<xiu*n ol gen*a., engineered c* or heterdc^ous DNA 
■ , ZlbW rriammals human or no„.t,un«n, ma, be adapted to the practice of 
T """"l ^TZZ^L various DNA constmcts Into the intended recipfen,^ For 

be referred to as the transgene. 
'''°'re,fs::"e:n,ransduc^ex.,« 

.e grown .rclre under selective conditions and ce.s Which are se,^^^^^^^^^^^ 
desired construct(s) a,ay then be expanded and tunher analyzed, "'J^; * 

rhrr:icroro=^^^^^^ 

grown in culture or introduced into a host organism. 

Depending upon the nature of the cells, the cells may be introduced into a hos 
ni.m e 0 a mammal in a wide variety of ways, generally by injection or .mplantation 

:^ irtir ^^^^^ 

the cells to their intended destination. Illustrative sites for injection or implantation inc ude he 
vascular system, bone marrow, muscle, liver, cranium or spinal cord, per one^^^^^^^^^^^ sk,n. 
3 Hematopoietic cells, for example, may be administered by injection into the vascu la 

,010 cells. The number of cells which are employed will depend upon the circumstances, 
the purpose for the introduc^on. the lifetime of the cells, the protocol to be used, or 
xamp e. the number Of administrations, the ability of the cea^ 
,0 he the^peutic agent, the physiologic need for the therapeutic agent, and the like. 
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Generally lor myoblasts or „broblas.s for example, the nur^ber of cells will be at leas, 
lou, 104 ar.d r.ot more than about 109 and may be applied as a d,spers:on, generall 
tZ Nected a. or near the s.te of .nterest. The cells .ill usually be in a phys.log.cally 

H„« . IS Patent No 5 314,171 (Baxter tnternalional, Inc); Uludag and Sefton. 1 993, J 
Host. US ,;„o).12 3-21 (HepG2 cellsfltydroxyelhyl metltactylale-methyl 

Biomed. Maten Res. ^'^OM™ 2 P ^ ^ 

t:iT'!rctDts;;r,.3,a,gLt,,;Taia^ 

:ll',,.or,s:xpteL!n,n3«a,.na,e-po,,.t.,ys,ne.^ 

Ttansptantatton Ptdc. 27,e,.3«, 3350 Wna.e , '^^^^ -JJ^^ 
Transplantatidn Proc. 27(6);3389 (algtnale). Lakey et at. 1996. "^ J 
27,6,:3266 (alginate); Ko^ « a,. 1995. fj^^'f '^^^^^^ 

Del e. at. US Paten. Mo. 5.429.82, (alginate); EmencI, e a 993 Ne- 

:r:.edL.a.3pina,s— 

(encaps...ed .y..d™ias P™-^ — r^utr^ ^^^3 engineeted 

elodiments the semipermeable encapsulation renders the e 

•,.muno.ogical.y isolated from the host organism in wh.ch ^']^^^^^^^ 
introduced, in those embodiments the cells to be encaps^^^^^^^^^^^ 

fusion proteins containing component doma.ns derived from P^°*^'"^ °' ' 
and/or from viral proteins or proteins from species other than the host speces. The 
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lormulated DNA. Prior to fornnulat.on o ^NA e^, - ^ ^^^^^^^^^ ^^^^.^^ 

to incorporatior. In a final -P--^;;;: j^^^^^^^^^^ ,or expression (e.g., 

desired DNA constructs may r.^^^^^^^^^^^ 

inclusion of an intron .n the 5 ""^^,^"f 7:'^.-. ^995) Formulation of DNA. e.g. 
sequences (Feigner, et a... Ann NY Acad Sc. ^6-1 39. W ^^^^^^ 
with various lipid or,iposomematenals.mayt^^^^ 

materials and delivered to the recp.ent ^^o^^^^^ recombinant human alphat - 

cell Mol Biol 10:24-29. 1994 (in ^^^^j;^^^^^^^^^^^^ of rabbits); Tsan et al. Am J 
antitrypsin gene complexed to cat.0^^^^^^^^^^^^^^ 

Physio, 268 (Lung Cel. L cationic liposomes); Alton 

via tracheal insufflation of plasm.d DNA alone 0 P ^^^^^.^^^ 
etal..Nat3enet. 5:135-142.1993 g^^^^^^^^^^^^^ 
efcDNA.liposon.ecomple^^^^^^^^^^^^^ 

DNA can be earned o^^V -^-^^^'^ .^^^^ ,3,„,e, or by nebulizat.on. 

Ringer's, phosphate buffered saKne. or 0 adenovirus, adeno- 

viral systems include those based on ^'^^^^^^^^^^^^^^^^ 3,,,^ ,or 

associatedvirus.hybridadeno-AAV..ent...s^^^^^^^^ 

transduction by infection, and in some --^^ '^"^^ ,3„. ,oad. Sci. USA 81 , 
host genome. See. for example, ^^^.^..^ et al. (1989) Proc. Natl. 

7529-7533; Kaneda et al.. (^^f ^ ^^^^^^^^^^ b 0 . Chem. 265. 17285-17293 

,0 Acad. sci. USA 86, 3594-3598; Hatzog^ et a,^ (1 990 ^. 

and per., et al. (1991) inhalation, orother 

administered by in,ect.on (e.g. intravasculany ^f the DNA via 

parenteral r.ode. Non-v.ral .ay also be used. See e.g. 

complexes with liposomes or by '"l^^^^"' ^J^^^^^^^ ^.^^p.e, for additional 

3s WO 96/41 865. PCT/US97/22454 and USSN 60/0^81 9-J ^^^^^.^^^ 

— r::— ^^^^^ 
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useful in gene therapy. ,,,^-^015(18 ;he recombinant nucleic acids m 

vecws, «us. lipid lo™*.ion or ome' O'rn) "^^^^^^^^^i p,epaBI»n ol .he gene 
o, a number o, Known n,e,..OS, For „* ^ i„,3„i„„, i„.,«,ion, 
«.e^ s,3>er„ can be in.rcOuced ^^^^^l^,.,. or seiec.ve .ransducrron 
e,c jn sorm sys.ems. .be means ol deliver, P'"*= J ,egional or iocai 

o. — , "-trr;:':™ r;"rerriniec.ion, c... . 

administration (see U.S. Potent 5.3.a ) ^^^^^ i^3,3^ce, 

,,,.,P.ASUS«.:30S«OS,orb,^^^^^^^^^^ 

— rtr;r::r 

delivery of various ligands to cells in vitro and to organ.srn. ^^^^^^^^^ 

„ preferred embodiments of the — he ^^^^^^ 

derived by incorporation 0I tbe genet,c ^^^^^^^^^^ ,,,3 (AAV), hybrid 

including a recombinant retrovirus, ^'^^'''^i^Xo^^^^^^^^^ 

adenovirus/AAV. herpes .rus or lent,v.ms ( 1 houg .^^^ ^^^^,,3 ,e 

using recombinant bacteria, or euKary^d^^^^^^^^^ 

used in the practice of this mvent,on. AAV- and adeno ^^^^ ^^^^^^ 

systems), whether ex vivo or in vivo. 
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Adenoviral vectors invention utilizes adenovirus- 

A viral gene delivery system useful ,n ^ " ^^^^,.^3, , 36 kB, linear and 
derived vectors. Knowledge of the genet. -9-'- "J f;;;,, adenoviral DNA with 
double-stranded DNA virus, allows subst.tut.on of a -9 P - J 

.oreign sequences up to S KB. In ^^^;^^ :^^^e.o..^^ ONA can replicate 
host cells does not result in chromosomal 'ntegrat-on be^^ adenoviruses are structurally 
in an episomal manner without potential Senotoxic.iy , 

extensive ampliiica^'ori. 

stable, and no genome rearrangement has been detected 
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(1991) Mol. cell Biol. 

«7.6141-6145; Huber et al., (1^^^ 3^,55^06 254:1802 ■ 3.541-547; 

, al., (1992) PNAS USA 89.76 395. hwu el a - (1993^ 

Da. etal., (1992) P^^AS USA Patent No. 4.980 286 _PCT^^^^^^^^ p,, 

4,15; U.S. Patent PCT AppV-oafon WO 

89/07136-, PCT Appl'cation WU ^^^^^ ^, 

Furthermore, it has oee ..^^ased vectors, by moa.iy ^ plications 

.trovirusesanaconse^uen^^^^^^^^^^ 

packaging proteins on th^^^^^^^^^ '"^^Tl 0^^^^^^^^ 

W093/25234, WO94/06920 ^n^ ^,,,,,3 >nciude^ c up 

n^cKiification d the infection spectrum ^^^^.^ ^^^^^ 3, (198 ^^^^ 

;ecitictorceii surface ant,ge^^^^^^^ 

86:9079-9083; Julan et ai.. (1 992) ^.^^^^^ en p 

Viroiogv163-.251-254); --;pXu^46). Coupling can be .n t^ J^^^^^^^ 
,Ul991)^.B>olXhem^66.^^^^^^ 

cross-linking with a Pro^^'";;°^ .ting lusion prote. ^^^^ l^^^^^, direct the 
asiaioglycoprotein).asw -^VJ^^^^^^^^^ 

antibody/envtusionP^^^^^^^^^^^^ 
infection to certain tissue w 

an amphotropic vector. 
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ne therapy have been 

0,.er Viral Systems ^^^^ :P^""T;arnt No. 5,631 ,236 by Woo 

Other viral vector sy ^ ^,^^3 (U.S. .Mammalian 
,envedtromherp.virus, ..^2^^ 

al., issued May 2 ■ , p oenhardt D T ed^V ^^^^^^ ^,986) 

expression vectors la 9 g^nerworth,, Ba.ch 
cloning vectors and their uses. 
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Trnn<ient and stable 

„, ,„ pa«.a., ^-^-r™;;, --as s,s>a. an ^ J, 

,«,u,as ,0, va,« s„,.en, »86. -P- =' ' 

W,th the recent recognition o ^.^^^ sequences. In vnro 

1990, supra). 

"""""" a.ga PO''"^ »' 1 were pa-Wa* a««e 
properties ior liver^lirected ^^-^J' ^.to duck hepa.Us B v,r s 9 

HTpatology,14:l24A). 
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---n;;r;r:Spa«aaa,e..^^^^^ 

ottervisa t«™"'='«°''/; °' ,,,,ap,. This ca. ba accompWed b, 

.i, ;„ vivo, l«9.<Ji.ec.a* 9="^ ^J' ^^^^^^^^^^ „a bronchoscopy. Rac n » ^ 

,„c,a«n.b«-*^^^^^^^^^^ 

— Taro-a-CPTna.— 
r;:;tasaP.-.sw Hbnoaoa.e.o»ce«ec 

transfect cells v..th.n ^^^^JJ^^ ^ the alrv^ays and acce. 

epithelial cells, leukocytes res.d.ng 



52 



10 



15 



PCT/US98/n723 

WO 99/10510 

observable ligand-resp Averse effects. ■ ^, oi factors, as discussed 

..erapeut. .er.ei,t w-t^o^ -^^^^^ ^,,3 deper^ds on a var.e. 1 ;^^^,p,,,ea,iV 
Optimal dosages of DNA ^^.^^^ p^i.ent. Aga 

elective doses 0I vuuse ^^^^^^^^^^^^^^ about 1 X 107 to a 

saline solution containing ^^^^^^ 
e.g. iront 1 X 108 to 1 X O P ^^^^^ ^^^^^^^ a e 3 ^^^^^ 

,n a preferred e-^^^^ "^.^'^.^coding the fusion P-^^^^^^^^^,,, , ^.ay be 
..al panicles containing nude c ds ^^^^^^^^ " T hose encoding 

desirable to a^-'-^^^^^^ ;;,,3 include 1 -.3, VA V.10. ratios of viral 

^^ethods of the , or engineering cells t^^^^^^^^^^^^ 

The invention provides me ^ ^^^^^ -^^e cells m y 

,,,,,„ediatedregu^^^^^^^^^^ 

1 (ex vivo) or in ^,Hch may be of na - ^^^^^^ 

endogenous gene or an J occurring pepfde ^'^'''^!^^^,,^as or 

sequence, or may ccnta. ^-^^l^^.re^o.e. more ^^^^^Ze^^^o engineering 
rmpnsesintroducingintohec*^^^^^ 

^.positionsofthisinveo^o-B^^-^^^^^ 

of cells or animals (e.g.. ^.„,,,,i,g o t^e ^-1"^^'°" °^ 

.on^alorpatbo^g-.^^^^^^^^^^ 

or characterization of gen ^^^^^ ^^^^^,,3 delude 

functioning, --^^"'^J ,wo or ex vivo. , ,ells, or organisms 

human subiects, whether in v ^ .^^ 3 , elsewhere herein as well 

The invention also P^^l''' mentioned above and d e 

con^-'"^^--^°^TThrm^-^ 
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r:en,n.e,^.o-.— ^^^^^^ 

bacena. laseo. or ,east rfs, '"''■^f , noa-r>ar»,a,,an '^'^^P 

p,^„„9P-o..n,»*^^^^^^ 

e„9i„«red cells. Thus ^,p„3sion systems, n may ^ 

human subiect. in e.mer case. ^ P^^^ ^^^^^^ ^^.^,,3, can be 0 a P 

3, expression of a target gene. In jase ^^^^^ ^ ,,,ety of res a ^ 

veLinaw treatment of the -- or to - ^^^^ , ,,,apeut,c 
purposes, in the case of human subiects.t 

. ._„^trr^ont nroaram. 



purposes, m - — 
prophylactic treatment program. 
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10 



rnaches For example, the 
target gene to be regu ate ca ^, ,. , ^^^^^^ 

e«p essicn 0, one e: n,o,e gene ^ nu*e, ony'nPn 

,nU*o.o,.<r,nve-.n^^^^^^^^^ 

^0 protein, It may be pretera ydn, cyclosporin A, FK50b 

immunosuppressant drug, reactions. .^^^^^^^^ 

foregoing or other compound wh>ch r p ^.^^ ^^s Jected as having 

Ceilswhichhavebeen-d^ed ^^^^^^^^^^ 
may be grown in culture unde sdec ^^^^^^ '^l^^^l. the host cells 

^'^^ s ioTe production ot the desired ^^^^^^'^f^,,,, culture or 

ar,d/or assays lo the p pi^r^ned, e.g. 9 

have been identrfied, they m y 

introduced into a ^^ost o^an- ,3 ,,dogenous gene o it ^ ^^^.^^.^^^ 

""^r^tranrroneormoreotherreg-,^^^^^^^^^^^ 
engineered, the P^°"^°^^;;" ,,<-,caHy recognized by the DN 

lusion protein ot this mveni ^ent manner. Such ^ 

.cognizedandregulatedinagad^.^^^^^^^^ 

3, uselul in situations ' "J^. ,,,3, in one embodiment, one or m ^^^^ ^ 
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ideally with the expression control 

OHA.n.ngdo.a,n Which, cap^^^^^ 

element introduced ,nto the target gene^ ^''^ 50^ 

Oogenous gene .s mod.ed v,vo , ..^^^ , ,37) Ce, U-Oo. 

Known ,n the art. and j,,,er e, ai. d 989) ^^--^^^^^^^^^^ 

Mansour et al. (1988) ^^^^^^^^se BNA or a ribozyr^e or ot er RN ^ 
, -get gene n^a -^^^^^^ , tnvent on c euse^ ^ 

,,,scnptional activators prov^^^^ ^ ^^^^ ^^^^^ .^^ 

e q., antisense BNA, are p „ i^rlude biological research. „,n-od^-247 

■A^c^ hM a DNA-binding domain, n separate tusion v 

a„d « P'"""',!'' J^olpson factor (s^e FIG. D- » , ,„ a 

— -^^^^^^^ 

25 (SEAP) ora P domain. The amoun ^^^.^^ ^^^^^,^3 

recognized by the uiNA will reflect the extent to whicn v ,,. „ ri„rnains of this 

amount d gene product P-^u-^;^^^^^ ,^ „,e 8. use ot the 

30 sersilivityol ,„ a .ariety ol apptolW^ 

«e„*n8 peP«* <^*""'- g.nes 

protein such as an RLS tuston p 
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constructs (beanng the cDNA ,ns rts) are n ^ ^^^^^ ^^^^^.^ 

subsequently made to cor^tain) (0 a - ac^ en 9 ^ ^^^^^^^ ^^^^ ^^,3,,,, 
ONAbld,ngdon.a,nandatargetdor^ 0'^^^^^^^^^^ 

cor.tain,ng a recognitor, sequence - 1^^^^^^^^^^^^^^^^ ,^,,,,^,e responsible lor a 
which encodes a detectable gene product or ^^^.^.^^ ^ ,DNA-encoded 

detectable pbeno.pe. Cells -P-^^^^^^^^^ reporter gene construct. 

domain which binds to the target domam ol n ^^^^ p^^,^,, , 

rcorrespondingcONAcanthus^be|d— 

encodes b,nds to the target ^o-^'^'^^^^^^^ .^NAs which are expressed at lower 

a.,,tvtodetectand.e.^^^^^^^^^^ 
levels relative to other cONAs, GUN 

Another such application mvolves assays ^^^^^^^ ,^3,, 

second lusion protein containing a tr^"=="P^^^^^^ .^ein domain of .nterest. The 

econd protein domain of interest wh,ch Because the two tus,on 

ce also contain a reporter gene -nstruc - ^^^^^^^^^^^^^^^ ^ 3,,, may be 

;linsbindtooneanother.therepo^^^^^^^^^^^ 

,3 usedtoidenti^compoundsw^^^^^^^^^^^^ 

drug screening program. Thus, ceiis The presence or amount ot repo 

onLtedwithoneormorecompou« 

qene product is then measured. A ^'"'^'''^ J less or none of the substance, 
.stance, as compared to -P-s- -^^^^^^ Por additional data s 

30 indicates that the substance ^-^^'^"^^"^^^^ can be adapted to thrs ,nven. n. 
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Tons & Thoir Administration to Subjects Containing 
Pharmaceutical Compositions & Tnoir a 

Engineered Cells 

Administration . ,^ h, ,man or non-human subjec: using 

,.gand .a, be j'/^j;;, ,,,,„.„a„o„. Va„ous 

pMn™c«u»=a.v acceplable M.erials and m* ^^^^ 
,„™uu.».ns, .ou.es o. adn,nis,ra„o„, ""^^J^^^;^,^ 3«in.^ „, ,he .,gand .0, 

,,c igand binding domain, >he chace o. H ^ ^^^^^^ 

cmcumstances of the recipient, the respons ^, ,,,,3, gene 

..oavailability of the ligand, the biological ^^^l^^sl, etc. The drug may be 
p^oduct. the number and location 0. engine ed s e , 

administered parenterally, or more pre e ab - Do g ^^^^^ 
administration w.i. depend upon ; ^.^ected intravascularly. 

orally as a pill, powder, or ^^P--;^^; Agonists, as discussed 

intraperitoneally, subcu.aneously; 0 e -^^^^^ ^^^^ .^^^ ^^^^^ art 

t^elow) may be formulated ^^-9 ,,33 ,nd particular method of 

,or the various routes of ^YJ^l .nd be determined by the attending 

administration willdependupontheab^^^^^^^ 

physician or healthcare ^^^^^'^^^^^^^^^^^^ ,,g,,erize the fusion proteins of this 

response is monitored so Iha. ffie level of <>^^^^^[^^ ^ ,„g „„ „e expression 
,,ge, ,ene expression ,e.e, Response, one ma, adius. ,ne 

,e,els observed d™g me trrm penod ar,d «« „, ,„ „,hen,ise 

,eve, 0, s*se„uen. dosing .0 ai>er ^'-■^^^'^l^^:;:::,,,,,^^, ,epea,ed - .be 
,„ improve ».e .herapeutc response Tb,s P'°=«= » ,3 ,„ ,,„„aered 

dosage is op.i™.ed lor ^^^^ ^^^^ ten Lnoined. one ma, conduc. 

obronioally, once a rronienanoe <^^^-^'^J2,^^^ ,sponse ooniinues .0 

periodic Mlow.,.p monitoring to assure that ttte overan 

beacbieved. ,h.dn,n is to be reversed, administration of drug 

,„ the even, tbat the aotivatron t=v '^"^^T^ to elieo, a more 

ma, be suspended so tbat oeils retum ,0 = J ,,„,„„e,ed. An antagonist is 
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.usion proteins. Such compounds bind to the fusion prote.s bu do - ^ ^ ^'^^ 
of the fusion proteins as is required for activation of signaling. Thus, ,n the cas of an 
adverse reaction or the desire to terminate the therapeutic effect, an antagonist can be 
administered in any convenient way, particularly intravascularly or by 
inhalation/nebulization, if a rapid reversal is desired. 



""'"'"DrugsTi.e., the ligands) for use in this invention can exist in free form or. where 

• ■ f.rm ThP nreoaration of a wide variety of pharmaceutically acceptable 

compounds include .he oonvemiona, non..ox,o sa«s o, ,he dua,e,na,, „ , 
such compounds which are lomned, lo, example, torn inorgan^ o, organ o ao.*ol bases- 

The drugs may (orm hydrales o, solva.es. I. is known .0 .hose of sk. .n the art .ha. 
Charged ImZnds .on. h,*a.ed species when lyophillzed wilh wa.e. o, om, solva,ed 
™rie, when ooncen.ra.ed in a soMion wi.h an appT0pha.e organic solvent 

' ThX -n also bo adminis.ered as pham«oeu«cal composi.k,ns compnsing a 
,herapeu«cally (or prophylaCicaly) elfeCive amoun. ol .he dmg, and a Pt«'™^'=«"^"» 

eUion.. earners include e.g. saline. h*red ' 
alycerol erhanol and oombinaUons .hereof, and are discussed .n greaB de« below. The 

r;!;Cdes,red,canalsocon,ain^noramoun,so.we.ingoren.ls,.,,nga^^^^^^ 

„H buflenna aqenrs The composilion can be a liquid sofufion, suspension, emulsion 
able *i Xs e, sus«ined'elease fom.u,a,ton. or powder. The composKron can be 
fc m la.ed as a supposLory, wi.h „adi«ona, binders and careers such as .nglycen es. Oral 

I ,ion can Jude s.andard earners such as pharmacurica, grades of mann.o^ 
,a«e,s,arch,n»gnes,ums.eara.e, sodium sac*a.ne,cellulose^ag^^^^^^^ 
etc. Fomulaton may involve mixing, granulatng and compressing or dissolving me 
innrpdiPnts as aDDropriate to the desired preparation. 

' te^alLical carrier employed ma, be. fo, example, eifher a solid or iK,u,d. 
Ite.ra6ve solid carrieis Include lactose, .erra alba, sucrose, «=, gelaSn, agar, peotn, 
aca.a~ms.eara.e,stea.o add and *elike.AsolM earner can in*de one »^^^ 

rbs^nce! Which may also ao. as flavoring agen.s. lubncams. "^^'-^l;^^^ 
agenls, fillers, glidams, compression aids, binders or .ab,e..disin.egra.ing agen^ 

Lthyl cellulose, sodium carboxymethyl cellulose, polyvinylpyrrolid.ne. low melting waxes 

■) and ion exchange resins. 
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c.-,e. «u* ...... * « 

ca,„e,s ar. used in p,epa™,g -'"""^ X ^^n ba dissolved o, suspended in a 
o,esau,izad composiSons. Tl,e aC.e '"5'=*'" ^„ „ a n,«re o, 

'::Lced,,ca,,,accep,aP,ai«d,d^^^^^^^^^^^^^^^^ 

p„a,maceu..cai add«™es s«n as ^ ,,en,s, coio.s, v,s=.s., 

lee,ene,s. „a«nn. agenis, ^'-^^^''^^2<,,^,^'<>^ o, i,c,d,d ca,ne,s io, o.a. and 
,eguia.».s. s,abii,ze,s ., ;Lia,,, con,a,nin9 ad*li«s as aPove, e.c 

„a,enie,ai adminis«a.ion -ncidde «f ' * , ,,„„,„se soidiipn), aicohols 
03 ulose denvauves. p.eferaPI, sodium ,3, ,nei, den.aiwes, 

*1, -no.,dn= aicePois adn,.nis„*„, .no 

' nd o»= (e* <««"«^ '«»™' t 2, r2 and soprop,i n„ns,a.e, S.e* i jd 
,a,nercanaisobeanoi,,es>e^ucM^»^^^^^^^^^^^ 

carders are uselul in slenle iiqurf tonri co^ ,,<j,o«i,Pon or o.ner 

, car,ie,lorp,essunzed=praposi»ns-«^^^^^^^ 

20 form. .,,,„.e time delay material well known to the art, such 

The carrier or excipient may ^^^^^ a wax, ethylcellulose, 

as glyceryl monostearate or glyceryl ^-^^^^^^^ ,,3. when formulated for oral 
Mroxypropyln^ethylcellulose rne^^^^^^^^ 

/wide variety ofpharmaceutica forms^^^^^^^^ 
preparation can be tableted, placed >n a ard gel^ P ^.^^^^ 
;efLotatrocheor,ozenge.Thean.ou^^^ 
3, preterablywillbelromaboutasm^^^^^^^^^^^^ 

will be in the form ot a syrup, ^ ^ suspension. 

Suspension in an ampule or vial or nonaqueous hqu ,,,ep,able salt of 

' to Obtain a stable water soluble dosag form a pha ^ .^^ ^^^^ 

.edrugmaybedissolvedinanaqueouss^^^^^^^ 

3, 0.3M solution ot succinic acid or c>t^ ac^- A - ^.^^^^^^ ,,,pound ,s 

in suitable basic solutions. If a soluble salt form ^^^^ ^^^^^y, 

Isolved in a suitable cosolvent or --^-^^^^^ ,,,,,33 of such suitable 

dissolved in a suitable cosolvent or c -b.n^on ^^^^^^ ^^^^^^^^^^ , 



40 

60 



PCT/US9H/n723 

„.„„„, 0-60% of fcwal 

,„e.a,»s,o,».a»s.e,eo,,— 

adminislrauon to a patiem ^^t^adermal, intramuscular, mtrap ^^3, 

subcutat^eous, intranasal, p convenient or 0*^^'^ 

or local. For ex vivo appl-cation , ^^^^ ^^^^^^ 

cellular composition. composition is iormulated .n ^i^^ to 

^,*«9 a<,», and a „ rtxed ,oge.«, .n - dos^ 

containing sterile pna^--^^^^^^^^^^^ 

administered ^V^^^^^^,^ ^,ed prior to a^--^^^;^ ^,,p,,nd may be disposec. 

-^^^^rrdir:-^^^ 

...indevlcesplacedupo..^^^^^^^^^ 
and injections which release the 
30 release mechanisms^ 

Materials and methods w v invention. See e.g. u 

,„a„d™,.eadap.ed«P,.«.n^^^^^^ 

wavanous lotmdWns) and a P „ 3a„»t,a»n) and 

(paWishsd Apnl «, <ffl5. ■ P ^.^^ ,„ ,n, mnge 0 

5„„,.gs,pre1««»^;^,^„3^^3ana*..he»n,^^^^^^^ 
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con*e. Ii« ^ »' 1 ;Jco„.ai.er(s) can t« a no..ce | « 

® « ® 
..„...o,— ^^^^^^^^ 

ThP following examples contain invention in its various emu 

the equivalents thereof. The ^^^'^f,, . ^.^_hout this document, the >nvem. 

2nd Ed., ed. by SambrooK, ^^^^^ _ ^gss), y uvhridization 

nwACloninq, Volumes land lUU. I , roo 195- Nucleic Acid Hybnaizd 

(B.D.Hames&S.J.Higg>nseds.198 ), 
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s J Hiqgins eds. 1984); culture Of Anima Ce,!^^^^^^^ Perbal, A PracUcal Gu.de To 

A0.*mlc press. London 1987)^H ^ ,he Mouse Embn,o, 

; ,0. «, *i, and fspnna Har.o,, N.V., 

SpnngHartorLabotalon, Press, L, 



S stein B.& Greene, W.C. The w 

"""■""p «,d. USA. 

, sL (1996) controlling protein associauon 
p , HO S N Crabtree, G.R. & Schre.ber, S.L I ' .^.^n o( proteins. 

' , ^ ori qci 93 4604-4607. 

Proc. Natl. Acad. Sci. ao, 

,„ OR (1994) Mutational analysis of the 
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. u nq85^ Cell 43, 729-736. 

of the VP16 activation domam. 
. c, (1990) critical structural elements of the VP 

Cress,W.D.&Triezenberg,S.J.(1990) 

3, science 251. 87-90. . .^Pl 6 transcriptional 

.,H.CarevM.0992)Asvner.sticincreaseo,a— dVP16 
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^ OK (1989) Nature 340, 245-246. 
Fields, Song, 0-K. 0-^*^ ) 

n n«iv lor protein- protein 
o,ia94)T.,e two-hybrid syster.: an assay 10, 

F.elds,S.&Sternglanz,R.n-J^) 

„—...-- 

, Rrsconi S.&Schaliner,W. 

Transcriptional actuation modula.ec by 

253. 808-811 ■ 

c, h: K (1085) cell 46, 885-894. 

. M (1986) science 231, 699-704. 
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activation don^ain upon binding to a human TAF. Scenes 
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M 7 11992) Expression cloning ol a n ^^^^^ pcGNN 

sp L 'e,«n,. MO,, Ca... Bio, , , ; cMV enmnco, anO pron^o.e, 

^333,bled by stepwise -sert-n o Xb ^,,,,,i,,ons are as foliows. 

vectors. The individual components used a 

. . r^MA hinriina domain ir 
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• nh.Gal4DNA binding domain into 
nrGNN-GF2 was made by insertion ot ^ FKBP domains. 

PCGNN-GAL4 expression plasmid. PCbN . ^^^^^^^^^ 

R^mm digested DNA fragments ^^"^^'■^^.f,'? '° ^cGNN-MAMTS were made by 
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Example 2: Generation of ^^^^^'^^^ ^"f^ ,^.s,^p .ep^^er stably -ntegrated, 
To generate cells containing the PLH-5xC^^L4-.L^ ^gg_^^ 
helpeMree retrovirus, generated as ^" '^^^ ..^.stan, clones 

was used to ,n<ect HT1080 cells, ^-dreds o om - ( g^^^^^ ^^^^^^^^^^^^ 

.ere pooled (HT1080 B pool) -^^'^'^^^^ tor further analys.s. 

with pCG-GS. The most responsive clone, HT1080B, was 

Example 3: Transient Transf ections ^^^^^^^ 
HT1 080 cells were grown at 370 C ,n MEM -d'um "9 
essentia,aminoacidsandpen.cillin-streptomyc..Tw t^f^u^r^^^^^ 

approximately 2 x 1 05 cells were seeded ,n ^^^' ^^^o^^ , ,,,, well 

Instected using Upo.ectam,neas — 

received the amounts plasmds >nd,cated m the f.gure ^ 
p,asmid,withthetota,amountoiOHA..^^^^^^^ 

experiments shown ,n Fig. 5, 10 ng of plasm.a j y ^^sions were 

and increasing amounts of P'--d -P-ss^g 6^^ ^^^^^onj ^ ^^^^^ ^^^.^^^ 

included. Aftertransfed,onforf.ve hrs, the m«a -^^^^^ 

added. 1 B-24 hrs later. 1 00 ul -^'^ -^'Tnlx^^^^^^^ and 450 nm emission, 
a Luminescence Spectrometer (Perk.n Elmer) at 350 nm exata 
Where indicated. 2-5 ul of medium was also assayed for hGH prote.n 

(Nichols Diagnostic). 

FKBP12 and the olher conlaong a <^"=»"P™ e, ,,1). In Ihe 

subunu d NF.kB) lused io .he FBB doma« "'f^^s " a«iniV complex «* FKBP 
„ presence ol tne natural-product rapan^an. whch Ms a 9 ^^^^ ^^^^ 

.1,^ COR nRS fusion Drotem is efficiently recruueu lu 
and FRB domains, the FRB-p65 fus.or^ p ^^.^^^ ^, p65 

fusion protein. This basic system results m the dehvj o ^^^^ ^^^^^^ 

activation domain per DNA binding domain ^ ' ^^^-..g ^..p.e FKBP 

activation domains delivered to the promoter can b ncr a y ^^^^^^^^^ 

^^^^ 

proteins and deliver them to FKBP moieties linked to he GA ^^^^^^^^^ 
Lmple, the addition of a tetramerization domain pr^^^^^^^^^^^^^^ 
,0 between the FRB and activation domains should generate 
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domains bound to .ha promoter ,^ ^^.^ „ 

To tesl now bundled acwation domain wsion p ,„3io„ 
„ans,ec,ed HT,080 B .a,,s wl.b plasnids do™,ns ,o ,be 

p,o,eins and treated .he ce.s w»h ,0 ''^^^^^l^'^^l,,,,,^ , deUvered .0 each 
Uo.e.^Weobse™ed.na,wheno„,o.RS^^^^^^^^^^^ 

GAU monomer (GFU RS and SF'*"^^^ bund ,„^.„„ 
induced,he,epo«e.ge„e«^^^^^ 

p,d.eins. This (indng suggests ina. bund p„,j„, 
.neir abiiny » deliver more ^^^^^^^^^ .,,„„3 oonrbinafions o, DNA 

Wucers dl uanscrlpson. ''"^'^'^'-^^'^^'l^Zi^ns revealed Iha. «re level ol 
binding .usion P--^- "I"^^^^^ o. ac.iva,io„ domains .ha.can 
r,:=l=merbound.o,.sP^^^^^^^ 

The BLS fusion pro.ein is capable of '"'^'^^ ™; fusion protein 

domain to the promote, ,han i.s <i*er same 

, conraining four «dem^ rei.e,a,ed p65 « ,Hou,d have similar 
number of ac«va,ion domains ,o .he promo er ^^^^^ ^ ,„ „lS 

.ransacvatlon capaclt,. To examine -^^^^''l"^Z.^,'> expression piasmids 
„ the rapamycn regulated ^-^^'^f " ^ - BS4 or RLS fu^on proteins into 
enoodlng DNA binding receptor, GR . J" g^ne b, adding 10 

„ HTtOSO B ceiis and analyzed '^^^^^ ^^^l^,,, I repoher gene strongly 

„M rapamycin to ''-J^^^'^^'^^,^ Z Gf .and BS4 combination of fusion 
in cells expressing the GF1 and ML5 ,:„„ rtnmalns are weak inducers of 

protams, indicating tha, .he '^^f '^^7*: "aU "as ab„ to induce 
transcription in ,he dimedze, sysrem (F'S'^^l- " ~""» ' ^ „. fusion 

3, reporter gene exprest»o„ in the pn^sence ^^^^^^l^^^,^ o. protons. Without 
proteins, alblet at much lower levels than the GF, H 
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,e,ow .eve. of deteC.on, w.lch .ay ^ ^^^^^^^^^^ 3.a,egy, un..e 

uanscnpuon. These observat.ons ^^^^^^^o^^^^^^ ,ess tox.c to ce..s. 
.e.era„on, generates h,ghly potent act'-t-o o-a. th ^^^^^^^.^^^ 

one possible explanation for part or a,, of the ro st 
,H.Sf.s.on proteins. thatt^clo.^ 

produces an avidity effect. To test th,s we ^ ^^^^^ ^^^^^^ ^, LS 

Lrv. co-expresslng a limited ^^"^ ^^^''J^^^g'^^.^.^ ,,,,,ning, at most, a single 
„s.on protein should promote the ^-^"^^'^s oZu m the number of FRB domains in the 
FRB domain. To examine the ^-^^^'^'''''' ^^^^ HT1080 B cells with relevant 
H,S bundle on reporter gene -P^" .sponsive gene in the 

expression p.asmids and -^'^^ f^^^ ""^Veviously obsen^ed (see Fig. 2A). 
presence of 10 nM rapamycin .n ^^^ 'Z^preL^^ in cells expressing GF1 
apamycin induced only .ow levels of ^^P^^^^^^^tression was very robust in cells 
and RS fusion proteins. However. in cells expressing 

expressing GF1 and RLS fusion P- ^^^^^^^^^^^^^^^ J ^^^^^^ ^..^eed 

GF1 , a limited amount of RLS and ^J^^^^^^^^^^^^ Jeved by GF1 and RLS 
reponergeneexpress^ntoeven^^^^^^^^ 

iusion proteins alone (F.g. 3B). This ^^99 ^^^^^^^ ^^,i,p,e FRB 
expression by RLS fusion Protein^^^ 

domains in the bundle. Indeed, the data ^^ow" • activate gene 

FRB domains in RLS fusion ^.^^^^^^^^^^^^^^^^ ,,pUcin allows multiple FRB 

, expression to the maximum possibl leveUt ,ke^ P ^^^^^^^ 

BUS ,usio„ protein, we expressed l''^'"^^^°^^:;J^„ op, and induced 
aclds«-340) ,el*e,o RUS, '^'^''^ ^'ZciZit<rm6:«r,. In «s situation, the 
,epoder gene exptession by adding tO nM ^„ and one 

33 ,eLe„c.und,es.on,«d— 

FRB domain. Because teducng tlte ;„ ,„e presence of 

.epone, gene expression, ^7 t'^^ ^^^^^^^^ in the nontOer 0, ac.i.ation 
excess U region relative to RLS can be — ° „, a ponion o, 
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. ihp recombinant 

inducers of transcription pnmanlybecus 
activation domains to the promoter. 

and synergizing activation domams ^^^^^^ ^^,^^3 ,3 ^NA bind.ng 

The experiments described ^ 3,30 containing the FRB and activat 
domain) as the bundling domain in 'usmo Prote, a ^^^^^^^ ^^^^^^^^^ ,,,,a,ns 

c in addition to the tetramenzat.on domain, tni p ^^e,her the 

Imerizafion domain ot lactose repress -^.^e ^^^^^^^ ^^^^^^^^^ -ding se,uen es 

, we made an expression P>--^^' ".^ .as replaced with a thirty-six ammo aoid 
(..inoacids.6.360)intheB^^^^^^^^^^^^^ 

region between ammo acids 324 and . ^^^^ combination of p65 and 

portion of t^pstream linRer region (-^"n J^^^;,^,,^^ ^^"^^^"^'Il^'rie? 
activation domains when fused to GAL4 DNA 9 ^^^^ .^^^led together 

. ca^^ing a stably integrated GAL4 respons-- -po^e j ^^^^ ^^^^^ 

3.ong stimulation of ^^^^^^^ domain suggest that the minima, 

3, the lactose repressor ,,sion proteins, 

tetramerization domain IS sufficient TO 
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threshold ..mbcr of actWators required to 
7. Bundling reduces the thresn 

Ic Lors required ior robust expression oUhe P 

and two-fold higher levels .n ' as compared to GF4.RS ius 0 P 

combination of fusion prote.ns. ^^^J^ the lowest a-ounte of RM 

amount of the express.on plasm.d m the P 

activators by adding va-v-nQ ^"^"^"^ °' ' r^TS fusion proteins >nduced he rep 
;;.ONAbindingrecepfor-^B^^^^^^^^^^^ 

gene expression robustly at nM pa y ^^^^^^ ^^^^^^ ^^^^^^^ ^Z^^^e.^ 
3, rapamycinintheme.^^^^^^^^^^^^^ 

reporter gene s,gn,f.cantly abo ^^^^^ ^"P""^^ ' lining multiple 

,evels of reporter 9ene exp^ss 0 " ^ ^^^^^^^ ^, activators or^. ' 0 ^^^^ ^^^^ 

SB), collectively, the ^^^^^ strongly induce gene express.on 

bundled activation domains are su 
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«. Bundling activation domam fus.on p 
Examples, tiunu a In the presence 

in some cases, cell deatn^ ^^^^^ both J 

.ybnd proteins iinding each othe^ .3 p^sent at relat. 

mammalian two nyui >u 5143. i ne ^ 

n-Cbl ptoto-oncogene hav9 bu«n sn assays. However, i 

combinations of ^xp^ess-on P ^^^^ ^^^^^ e SH3 VP 

observed that neither GCBL alone ^^^^^^^^^ ^^^^^^ d he eporter gene 

induced the reporter gene express - ^^3, " t^ai usion rotein 

«p,essl«n « .0 .hei; " J * western blo. sbown ,n ■ aC 

,,sa>es,,o..be.— ^^^^^^^ 

.wslrales thai me ' „ j,„„o,e<J oobrte,parts, SH3 LVP 

preseatalhSforamourtsthaUbe 
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wacellular levels of .he ac»va» boma " ,„^o p ^^^^^ 
„ probed with GAL4 a*od, «e w « e .0 « ^^^^ ^ 

3u99e*9 ma. .us,o„ P«,n ,s .0 = » ce^ , 
»d.„9 oompopen. (6CBL) ,s ~„;,e„„g a spffiolon. numbe, 0. aouva.iop 

a*a,i0P don... Iu»n prceios -^J^^^^^^ 0. « repoPer gene .0 oocpn TaKeP 

rp=r:ro::rpo=:=a.,e.:o.,e.e,3.p.peo^^^^ 

-'C:-ed.P.bea.w,,Ue».P.--— 

—dCrti— a=^^^^^ 
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Claims: 

,.,c -CM encoang =. toon p,o,e,n con.aHn, » domain and 

, ,,e.co..-«,e,cac,ao,aa,.U*,e,n.ne.n«».a,n:sa*«..^^^^^ 
i„ain...menza.,o„do™ino,...ran,.nza«ondo™., 

, . „, H,„ 2 .herein ,he bundling domain is or is darrvad 



zipper domain. 

, , -,.= 1 ■^wherein the heterologous domain IS 
4 The recombinant r^uCeic acid of any of claims 1-3 where. 

a transcription activation domain, 
a transcription repression domain. 

a DNA binding domain. 

, , -^d wherein the heterologous domain IS 
The recombinant nucleic acid of any of claims 1 -3 wherein 

a ligand binding domain. 

^ r^Rc; VP16 or AP domain, 
derived from a p65, vt- id ui ^ 

, ™reoom.,nan,n.,eioaoido,o,.msw^— ^ 
derived trom a KRAB domain or a ssn-6/TUP 1 or K 

K an, nucleic acid ol Cairn 6 wherein .he heterologous domain is or is 

:L::rat:^:-r„mpos,.eo^^^^^^^^ 

,,,.,ac„mhinan,noc,eicao.o,c,a...^^^^^^^^^^^^^ 
derived from an immunophilin, cyclophil.n, FRB, 
domain. 

, i.irn 1 1 Wherein the heterologous domain is or is 
,2. The recombinant nucleic ac.d of ^^^^^^^^^^^^ ,,,eptor. 

derived from FKBP. tetR, progesterone receptor ecdy 
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, I 2 3 4 6 8 or 10 wherein the fusion 
13. The recor.b,nant nucleic acd of any of cla ' ; . domain and a. 

protein corr^poses a bundling donaain, at Last 
leasione DNA binding domain. 

I ■ 19 3 4 7 8 11 or 12 wherein the 

and at least one ligand binding domain. 

< 1 s1''359''''^ or 12 wherein ttie 

and at least one ligand binding domain. 

I ■ 1 2 3 6 7 10 11 or 12 wherein the 
n 6. The recombinant nucie.c acid of any of daims . ^ ■ ^^^^.^ 
fusion protein comprises a bundling domain, at least 
one ligand binding domain. 

fH ■ s1 2 3 4 6 7 8, 10, 11 or 12 encoding 

:;r;rr:::tx ^^^^^^^ 

activation domain and a DNA binding doma.n. 

, ■ -H nf rlaim 14 wherein the fusion protein contains a lac 
, 8. The recombinant nucleic aad o o^^^^J ,,,,, one p65 

repressor tetramerization domain, at least one 
transcription activation domain. 

g. 31417 or 18 wherein the fusion 

Which contains one or more of the mutations of figure 7. 

^ina a fusion protein containing at least one domain 
20. A re«>mbinant nudeic acid encod ng a fusion p ^^^^.^ ^^.^^ 

mutations of figure 7. 

. , , ,.90 wherein the heterologous domain is a ligand- 
21 . THe recombinant nudeic acd of daim 20 wherein 
binding domain. 

, . H of claim 21 wherein the ligand-binding domain is or is 
22 The recombinant nucleic acd of da.m 21 w 
derived from an FKBP.cydophilin or FRB domain. 
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. -.oiCm 21 wherein the ligand-binding domain is or. 

23 The recor^binant nucleic acd of claim 21 

denvedfromatetRdonaain. 

24 The recomDinant nucleic 
denvedirom a hon^one receptor domain, 

. -.o' claim 24 wherein the horn.or.e receptor domain IS a 

25. The recombinant nucleic acoo, claim 2 

steroid receptor domain. 

. ■.o1claim20whereintheheterologousdomain,saDMA 

26. The recombinant nucleic acdot claim 20 

hindina domain. . 

is derived from a GALA, lex h 

. .oded by the recombinant nucleic acid of any of claims 1-27. 
28. A fusion protein encoded by tne 

domainandiONAbindingdomam. 

do:^„anda«.ona*aUondo™... 
,a, a fet nucteic aad enc<«a ^„,„,j„„ domain 

t:::«ao^«no.o™."- 

. «pq 31 which further comprises a target 

3--"-r::r::-:— ^^^^ 

gene operatives linked to an expre 

33. A nucleic acid composition comprising 
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ligand binding domain, a itci 
expression conlrol sequence. 

expression control sequence. 

, ,.^32 or 33 Which further comprises a nucleic aod 

donnain. . ^^.^ 

transcription activation 

3,„.to,c^pn.«an.cieicaci.co.osi.ooo.a„,o,c,.i..a.3. 

, 40 wt.e,ein tha vacto, is saiected t.om «^-J^^;::^,°»s. 

:u::-:-v:aU,-o*a,.c,ots,.^.ioa*„o«.sM 

vectors. . 

,,,„«o,4,.«nis..*„pac.9a.in.ot=oo.«— ■ 
42. The vector of claim 40 or A 1 



30 or 31 
and 
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• • n.mrnetqene construct comprising a 
. ,hP second virus additionally comprises a 



activation domain. .omo rises a nucleic 

,.nscriptionac.vationdoma,n. „„3.3 a tKird recombinant virus 

.ecompositionotclaim.^^^^^^^^^ 
containinganudeicacidenoodingafusionp 

transcription activation domain. 

• «ihird recombinant virus 

containing a nucleic aoiu 
transcription activation domain. 

o third recombinant virus 
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. • ^1 wherein the recombinant virus is selected 

, .1 or.,/ r^fthp compositions of claims 4 J, ^t.'*". 
by introducing into the cell any ot the connp 

. , 53 or 54 wherein the compositions are introduced ex vivo. 

55. The method ot claim 53 or wnei 

K H f., aim 53 or 54 wherein the compositions are introduced in vivo, 

56, The method of claim bc5 or w., 

57 A host cell containing a nucleic acid of any of claims 1-27. 
59 A host cell containing a nucleic acid composition of claim 34 or 36. 

61 . A host cell containing a composition of any of claims 45, 47 or 50. 
62 A host cell prepared by the method of any of claims 53-56. 

es The ™ihod 01 *im 64 wherei. *e orsartsm is a mammaL 



66 
primate. 

67. The 



method of claim 66 wherein the pnmate is a human. 
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K-.ont nNA seauence comprising a first portion 
es. A DNA vector contaNr^g a ^^^^^ add.iona. domain that is 

::::: 

DNA sequence of interest. 

A cell conta,n,ng recombinant nucleic aads encoding 

a.st,usionproteincompns,nga.nd.^^^^^^^^^^^ 

« and one member o, a P^P^^^^^^^^^^^^^^^ ^nd tbe otber 

(b) a second fusion protein compnsmg a DNA bind g 

member of the peptide binding pair. 

,,„j:ra.c:pLo.-n.,o.He^^^^^^^ 

„,e,eip me cs. farther conBns a f " ,,,„aatioP ol .he wo 

con.,0, stance which pe.mi,= reporter gerre e«prass,o 

fusion proteins. 

70 A genetically engineered host cell which ^^"P"^^^^^^^ ^„,,ol element, 

(a) a reporter gene linked to a regulatable expr es ^ 
a^strecombinantn..^^ 

binding domain linked to a protein a ^.^^^^ ^ 

„ A 9ena.te», epghaered rtos. call which „„^„ eiemen., 

s :xrr:rrirc:ra:-p-r"^^^^ 

i::r^h,in.a..oapro,e,rt.— .-^^^^^ 

linKeoioaii H . i,»n«-iiolion activation doniam 
borrfing domarn and atrahscnplio" „e. 
„.erain association o, the tuslon proteins activates axprassron 

n. A geneticait, engineered "'^^^^^^^Z^^^^ oontroi eienren, 

,a) a reporter gene iinked to a ^ ;„ „„pKsing a 

'^r:S:r=h«^-P-ihd«rnaino.,«eres.a„d 
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traascriptioa activation domain ''"ked ^° ^^^^ ^ ,est library 

interest which comprises the steps. members of a 



library. 
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